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FOREWORD 

The  present  Monograph  is  based  on  an  earlier  Memorandum  produced  by 
the  Directorate  of  Ordnance  Factories  (Explosives)  for  the  use,  primarily,  of 
those  concerned  with  pilot  plant  and  plant  scale  experiments  on  chemical  manu- 
facturing processes  in  the  Royal  Ordnance  Factories  (Explosives).  Much  work 
of  this  type  was  being  carried  out  and  it  had  become  evident  that  it  was  desirable 
for  the  results  of  such  experiments  to  be  subjected  to  critical  tests  of  significance. 
A  convenient  account  of  the  straightforward  tests  of  significance,  written  from 
the  point  of  view  of  the  individual  who  has  to  apply  them  in  practice  without 
necessarily  a  full  knowledge  of  their  theoretical  background,  was  not  readily 
available,  and  an  attempt  was  therefore  made  to  prepare  one. 

It  was  evident  that  to  apply  tests  of  significance  conveniently  and  economically 
the  experiments  had  to  be  planned  in  appropriate  forms.  It  is  considered  that 
the  methods  outlined  should  be  as  much  a  standard  tool  of  the  industrial  experi- 
menter as  a  chemical  balance  is  of  the  laboratory  experimenter.  In  carrying  out 
an  industrial  experiment  the  choice  is  not  between  using  a  statistical  design  with 
the  application  of  the  appropriate  tests  of  significance  or  the  ordinary  methods  : 
the  choice  is  between  correct  or  incorrect  methods.  Even  the  simplest  experiment 
requires  an  estimate  of  the  significance  of  its  results. 

The  practice  sometimes  followed  of  consulting  the  statistician  only  after  the 
experiment  is  completed  and  asking  him  "what  he  can  make  of  the  results"  cannot 
be  too  strongly  condemned.  It  is  essential  to  have  the  experiment  in  a  form 
suitable  for  analysis  and  in  general  this  can  only  be  attained  by  designing  the 
experiment  in  consultation  with  the  statistician,  or  with  due  regard  to  the 
statistical  principles  involved. 

The  present  Monograph,  therefore,  is  intended  to  be  a  guide  to'  both  the 
I  planning  and  the  interpretation  of  experiments  on  the  industrial  scale,  and  it  is 
J  hoped  that  the  methods  described  will  become  part  of  the  everyday  technique 
A  to  those  who  carry  out  such  experiments. 

C  R.  C.  BOWDEN, 

Director  of  Ordnance  Factories  (Explosives) 
Ministry  of  Supply 
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FROM  THE  PREFACE  TO  THE  FIRST  EDITION 

The  present  monograph  is  intended  to  provide  for  those  who  carry  out 
investigational  work  a  guide  to  modern  statistical  methods,  both  the  use  of  tests 
of  significance  to  attain  reliability  in  deductions  from  experimental  data  and  the 
use  of  statistical  design  to  attain  the  maximum  precision  with  the  minimum 
expenditure. 

The  subject  is  treated  entirely  from  the  practical  point  of  viewj1)  Theory 
is  at  a  minimum  and  the  only  mathematics  involved  is  simple  arithmetic.  Each 
statistical  method  discussed  is  illustrated  with  practical  examples  worked  out  in. 
detail  to  show  what  exactly  is  involved  in  its  use.  Some  of  the  more  advanced 
techniques  may  appear  rather  formidable,  but  it  is  strongly  recommended  that 
they  should  be  approached  through  the  use  of  the  simpler  methods.  A  little 
practice  in  these  will  lead  to  a  familiarity  and  confidence  with  which  it  will  be 
realized  that  the  advanced  techniques  are  merely  comparatively  easy  extensions 
of  the  simpler. 

The  theoretical  background  on  which  the  author  has  drawn  is  of  course  that 
largely  developed  by  Professor  R.  A.  Fisher,  expounded  in  his  "Statistical  Methods 
for  Research  Workers"  and  "The  Design  of  Experiments." 

Indebtedness  is  expressed  to  Prof.  R.  A.  Fisher  and  Dr.  Frank  A.  Yates  for 
permission  to  reprint  Tables  III-VI,  XII  and  XXIII  from  their  book  "Statistical 
Tables  for  Biological,  Agricultural,  and  Medical  Research"  (Oliver  &  Boyd, 
Edinburgh  and  London),  and  to  the  British  Standards  Institution  for  permission 
to  reprint  certain  factors  of  Quality  Control  Charts  from  B.S.  600R,  "Quality 
Control  Charts." 

The  examples  used  here  are  all  the  results  of  experimental  and  investigational 
work  in  the  Royal  Ordnance  Factories  (Explosives),  and  the  author's  thanks  are 
due  to  Mr.  A.  C.  H.  Cairns,  Mr.  A.  H.  Woodhead,  Mr.  D.  Newman  and  others 
whose  collaboration  has  been  invaluable  in  the  development  of  the  ideas  and 
outlook  presented  here.  The  author  is  also  indebted  to  Dr.  R.  C.  Bowden, 
Director  of  Ordnance  Factories  (Explosives),  for  his  active  interest  and  encourage- 
ment throughout  this  work. 

K.  A.  B. 

PREFACE  TO  THE  SECOND  EDITION 

Chapter  I  ha.  been  extended  by  sections  on  Youden  Squares  and  on  Random- 
isation. In  Chapter  IX  (g)  the  comparison  of  regression  lines  is  discussed  at 
greater  length.  Chapter  XII  (g)  has  a  numerical  example  of  a  Latin  Square. 
The  methods  of  computation  for  Youden  Squares  and  some  examples  of  them 
are  given  in  Chapter  XIII  (e).  Chapter  XIV  is  extended  by  discussions  of  con- 
founding with  factors  at  three  levels  (Section  (g))  and  four  levels  (Section  (h)). 
Tables  of  the  angular  transformation  (VI)  and  of  random  numbers  (VII)  are 
added  to  the  Appendix.     A  number  of  minor  additions  have  also  been  made. 

K.  A.  B. 


(^The  reader  will  find  the  mathematical  theory  of  the  simple  statistical  tests  well  dealt 
with  by  A.  Linder  :  "Statistische  Methoden  fur  NaturwissenschSfter,  Mediziner,  und 
Ingenieure"  (Verlag  Birkhauser,  Basel).  Attractive  treatments  which  do  not  make  excessive 
demands  on  the  reader's  mathematical  knowledge  are  Paul  G.  Hoel,  "Introduction  to 
Mathematical  Statistics"  (John  Wiley  :  Chapman  and  Hall,  1947)  and  C.  E.  Weatherburn, 
"A  First  Course  in  Mathematical  Statistics"  (Cambridge  University  Press,  1946).  Full 
and  comprehensive  texts  are  M.  G.  Kendall,  "The  Advanced  Theory  of  Statistics"  (Griffin  : 
Volume  I,  1943,  Volume  II,  1946),  and  S.  S.  Wilks,  "Mathematical  Statistics"  (Princeton 
University  Press,  1946). 
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CHAPTER  I 

INTRODUCTION 

(a)  Experimental  Error 

In  all  experimental  work  our  observations  are  always  subject  to  experimental 
error.  No  experimenter  would  seriously  dispute  this  statement.  This  makes  it 
all  the  more  remarkable  that  only  recently  has  it  begun  to  be  generally  admitted 
that  in  order  to  judge  the  trustworthiness  of  any  experimental  result  it  must  be 
compared  with  an  estimate  of  its  error,  an  operation  known  to  the  statistician  as 
a  "test  of  significance."  It  can  be  said  at  once  that  the  lack  of  this  test  of  signi- 
ficance is  the  main  origin  of  the  unsatisfactory  nature  of  most  industrial  experi- 
mentation :  the  errors  are  large,  larger  than  is  realised,  and  the  apparent  effects, 
attributed  by  the  experimenter  to  such  factors  as  he  has  varied,  are  in  reality 
arising  solely  through  the  accidental  fluctuations  due  to  these  errors.  If  rigorous 
tests  of  significance  were  applied,  many  of  the  assertions  made  would  be  shown 
to  be  based  on  quite  inadequate  evidence  and  many  would  also  be  found  to  be 
fallacious  (the  fact  that  an  assertion  has  been  made  on  inadequate  evidence  of 
course  does  not  imply  that  that  assertion  is  incorrect :  it  may  turn  out  that  the 
asserter  was  lucky  in  his  gamble.  However,  the  plant  manager  in  taking  the 
advice  of  his  experimenter  does  want  to  know  whether  he  is  acting  on  a  scientifically 
proved  fact  or  on  a  subjective  guess). 

(b)  Classical  and  Industrial  Experimentation 

In  this  question  of  the  importance  of  error  we  have  one  of  the  principal 
differences  between  laboratory  and  industrial,  or  plant  scale,  experimentation. 
The  laboratory  experimenter  is  generally  in  the  fortunate  position  of  being  able 
to  have  all  his  independent  variables  under  complete  control :  his  materials  can 
be  of  the  highest  possible  purity  :  his  measuring  apparatus  can  all  be  of  the  highest 
accuracy,  and  usually  his  errors  are  small. 

In  referring  above  to  the  laboratory  experimenter,  we  meant  particularly 
those  working  on  physical-chemical  systems.  Any  work  involving  living  matter, 
bacteria,  fungi,  etc.,  is  always  found  to  have  relatively  large  variability  outside 
the  control  of  the  experimenter,  and  as  far  as  the  magnitude  of  his  experimental 
error  is  concerned  he  is  more  in  the  position  of  the  industrial  experimenter  even 
if  he  is  working  on  the  laboratory  scale. 

The  industrial  experimenter,  on  the  other  hand,  often  cannot  obtain  complete 
control  of  all  his  variables,  for  frequently  the  scale  of  the  experiment  is  so  vast 
that  an  army  of  supervisors  would  be  required  to  keep  them  under  control.  The 
attainment  of  good  control  may  also  be  a  difficult  technical  matter :  thus  it  is 
generally  easy  in  a  well-equipped  laboratory  to  control  temperatures  within  very 
fine  limits  with  the  usual  thermostatic  equipment.  For  a  plant  scale  experiment 
this  is  generally  quite  impracticable.  Again,  it  is  easy  enough  to  provide  excellent 
lagging  for  any  laboratory  apparatus :  to  lag  an  absorption  column  50  feet  high, 
on  the  other  hand,  may  be  absurdly  expensive. 

Further,  for  the  industrial  experimenter  it  may  be  in  practice  essential  to 
carry  out  the  investigation  with  only  the  slightest  hindrance  to  normal  production. 
Thus  it  might  be  possible  to  obtain  control  of  the  independent  variables,  but  only 
through  taking  such  time  and  care  on  each  batch  that  the  throughput  through  the 
plant  would  be  seriously  reduced.  Or  we  might  wish  to  vary  some  of  the  inde- 
pendent variables  over  a  wide  range,  in  order  to  get  points  spaced  well  apart,  so 
that  their  effect  on  the  dependent  variable  would  show  up  clearly :  this  might 
lead  to  the  production  of  a  large  amount  of  out-of-specification  scrap  material. 
It  would  thus  be  necessary  to  work  in  a  narrow  range  of  the  independent  variable, 


which  means  that  the  magnitudes  of  the  effects  that  we  have  to  observe  will  be 
much  reduced  and  may  well  be  swamped  by  errors. 

These  factors  result  in  industrial  experiments  generally  having  a  much  larger 
error  than  laboratory  work.  Under  these  conditions  it  is  difficult  to  decide 
whether  a  particular  result  is  genuine  or  due  to  the  error,  and  this  calls  for  a 
statistical  test  of  significance.  Further,  in  order  to  get  a  significant  result  (i.e.  a 
result  in  which  the  apparent  effect  is  significantly  greater  than  the  error  in  its 
determination)  we  need  a  larger  number  of  experiments  (and  this  is  precisely 
what  is  so  undesirable  in  industrial  work  on  account  of  the  expense)  unless  we 
use  a  number  of  special  statistical  devices. 

Finally,  it  might  not  be  out  of  place  to  remark  that  it  has  been  the  author's 
experience  that  even  in  laboratory  work  the  errors,  when  calculated  objectively 
with  exact  statistical  technique,  are  often  much  larger  than  the  experimenter 
believes  or  claims  them  to  be. 

It  is  clear  from  the  above  considerations  that  the  industrial  experimenter  has 
to  face  a  set  of  problems  additional  to  those  of  the  laboratory  experimenter,  and 
hence  requires  an  additional  set  of  techniques.  These  have  been  developed  in 
recent  years  by  research  workers  in  agriculture  (notably  R.  A.  Fisher  and  his 
colleagues  at  the  Rothamsted  Experiment  Station)  who  have  to  contend  with  the 
weather  and  with  variations  in  the  fertility  of  the  soil  amongst  the  variables  outside 
their  control. 

(c)  Replication 

If  the  error  of  our  estimate  of  an  effect  is  too  great,  generally  there  are  two 
alternative  methods  of  reducing  it.  One  is  to  refine  the  technique,  e.g.  by  using 
more  precise  thermostatic  equipment,  weighing  on  better  balances,  taking  more 
meticulous  precaution  against  evaporation,  etc.  Alternatively,  and  often  on  the 
industrial  scale  this  is  the  only  alternative,  we  can  replicate  (repeat)  the  experiment 
a  number  of  times  and  take  the  average  of  the  results.  Unfortunately  this  process 
is  not  very  efficient,  for  the  error  of  the  average  is  inversely  proportional  to  the 
square  root  of  the  number  of  observations.  Thus  averaging  four  observations 
brings  the  error  of  the  average  down  to  one-half  that  of  a  single  observation  : 
the  average  of  sixteen  observations  has  one-quarter  of  the  error  of  a  single 
observation. 

On  the  laboratory  scale,  experiments  are  relatively  cheap,  and  we  can  afford 
to  take  many  more  observations  than  are  strictly  necessary  to  establish  a  given 
conclusion  with  a  given  level  of  significance.  We  can  easily  take  so  many  that 
the  error  of  the  average,  if  it  ever  was  very  high,  has  become  so  small  that  an 
exact  test  of  significance  is  obviously  unnecessary.  On  the  industrial  scale, 
however,  the  work  is  so  expensive,  involving  a  large  consumption  of  supervisory 
man-power,  raw  materials,  power,  etc.,  that  to  take  more  observations  than  would 
suffice  for  our  purpose,  defined  above  as  the  establishing  of  a  given  conclusion 
with  a  given  level  of  significance,  would  be  scandalously  extravagant. (x) 

(d)  Experimental  Design  :   Randomised  Blocks 

Statistical  theory  has  so  far  only  been  mentioned  in  connection  with  the 
theory  of  errors,  to  which  of  course  it  is  fundamental.  It  has,  however,  a  great 
deal  to  say  on  how  experiments  should  be  designed  so  as  to  allow  the  obtaining 
of  an  unbiassed  estimate  of  error  and  to  allow  the  error  to  be  at  a  minimum. 

Let  us  start  with  one  of  the  simplest  conceptions  in  experimental  design,  the 
so-called  "Randomised  Block." 

(*)  This  aspect  of  statistics  has  been  developed  considerably  recently  :  c.f.  "Sequential 
Analysis  of  Statistical  Data :  Applications" :  Statistical  Research  Group,  Columbia 
University  :  Columbia  University  Press,  and  G.  A.  Barnard  :  "Sequential  Tests  in  Industrial 
Statistics"  (Supplement  to  the  Journal  of  the  Royal  Statistical  Society,  Vol.  VIII,  No.  1, 
p.  1,  1946). 
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Suppose  we  wish  to  compare  four  "treatments."  We  are  using  "treatment" 
in  a  generalised  sense  :  the  four  treatments  may  e.g.  be  literally  four  fertiliser 
treatments  for  wheat,  or  four  diets  for  rats,  or  four  temperatures  for  a  nitration 
process,  or  four  concentrations  of  plasticiser  in  a  plastic.  Further  suppose  that 
the  material  to  which  the  treatments  are  applied  arrives  in  batches  large  enough 
to  allow  only  four  experiments  per  batch.  We  will  suppose  that  it  is  probable 
that  successive  batches  differ  appreciably  in  their  quality.  We  will  call  these 
batches  by  the  general  term  "blocks."  In  the  wheat  example  quoted  above,  four 
adjacent  plots  of  ground  may  be  considered  as  the  "block"  :  they  are  more  likely 
to  resemble  each  other  in  their  fertility  than  other  plots  belonging  to  other  blocks 
which  will  be  further  away,  possibly  in  a  different  field.  In  the  case  of  the  rats, 
it  is  probable  that  rats  from  the  same  litter  resemble  each  other  more  than  rats 
from  other  litters.  In  the  case  of  the  nitration  process,  a  tank  full  of  well-stirred 
acid  and  a  batch  of  material  to  be  nitrated  mixed  thoroughly  till  it  is  homogeneous 
would  be  the  equivalent  of  the  "block."  Further  suppose  that  we  are  going  to 
replicate  four  times  (that  is,  carry  out  each  observation  four  times(2)).  If  the 
Treatments  are  A,  B,  C,  and  D  and  the  Blocks  are  1,  2,  3  and  4  we  could  execute 
the  experiment  as  in  Table  1.1. 

Table  1.1 


Block 

Treatment 

1 

A 

A         A 

A 

2 

B 

B          B 

B 

3 

C 

C         C 

C 

.V 

D 

D         D 

D 

A  mom^it's  thought  will  show  that  a  scheme  such  as  that  in  Table  1.1  is 
obviously  absurd,  for  whatever  differences  show  up  are  as  likely  to  be  due  to 
differences  between  blocks  as  to  differences  between  treatments. 

A  slightly  better  procedure  would  be  to  distribute  the  treatments  completely 
at  random.  We  can  label  16  cards  A,  B,  C,  D  each  4  times,  place  them  in  a  hat, 
and  withdraw  them  one  by  one,  writing  them  down  in  order  from  left  to  right  in 
successive  blocks.     The  result  of  one  such  operation  is  in  Table  1.2. 

Table  1.2 


Block 

Treatments 

1 

D 

B          B 

D 

2 

C 

D         A 

B 

3 

B 

A         C 

A 

4 

C 

C          D 

A 

If  we  take  the  averages  of  the  four  results  for  each  treatment  they  will  be 
statistically  sound  if  the  randomisation  was  properly  carried  out,  but  the  error  in 
each  mean  (average)  will  be  inflated  because  it  includes  in  itself  the  differences 
between  blocks.     The  experiment  will  thus  not  be  as  accurate  as  it  might  be. 

It  would  obviously  be  better  to  allocate  each  treatment  once  to  each  block, 
as  in  Table  1.3. 

(2)  It  will  be  convenient  if  we  decide  that  a  "single  replication"  means  an  experiment 
carried  out  once,  "double  replication"  means  an  experiment  carried  out  twice,  and  so  on. 
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Table  1.3 


Block 

Treatment 

1 

A 

B          C 

D 

2 

A 

B         C 

D 

3 

A 

B          C 

D 

4 

A 

B          C 

D 

When  we  take  the  average  of  the  four  A's,  and  compare  it  with  the  average 
of,  say,  the  four  B's,  the  difference  between  the  average  is  completely  independent 
of  any  block  differences. 

It  might,  however,  be  more  satisfactory  if  the  order  of  carrying  out  the  experi- 
ments were  randomised  (as  they  stand  in  Table  3  it  is  systematic,  the  order  being 
A,  B,  C,  D  in  each  block).  If  there  was  any  tendency  for  the  first  experiment  in 
each  case  to  be  high,  then  this  would  appear  as  a  high  result  for  A  which  would 
be  fallacious.  Randomisation,  such  as  has  been  carried  out  in  Table  1.4,  would 
avoid  this. 

Table  1.4 


Block 

Treatment 

1 

D 

B          A 

C 

2 

B 

C         A 

D 

3 

C 

B          A 

D 

4 

A 

C         D 

B* 

We  have  now  arrived  at  the  concept  of  the  "randomised  block."  All  treat- 
ments to  be  compared  occur  an  equal  number  of  times  in  each  block,  the  block 
being  large  enough  to  take  all  treatments  at  least  once.  The  replication  is  achieved 
by  repeating  the  experiment  on  further  blocks,  the  order  of  experiments  always 
being  randomised.  Of  course,  if  the  size  of  the  block  is  such  that  several  replica- 
tions can  be  placed  in  a  single  block,  then  this  can  be  done  and  nothing  is  gained 
by  replicating  in  different  blocks.  (8) 

The  analysis  of  the  results  of  such  an  experiment,  with  the  testing  of  the 
significance  of  the  results,  is  discussed  in  Chapter  VII  (e). 

(e)  The  Latin  Square 

The  hypothetical  experiment  discussed  in  the  previous  section,  though  now 
rigorous  in  design  in  that  it  cannot  lead  to  false  conclusions,  is  in  some  cases 
capable  of  further  refinement. 

To  particularise  slightly,  suppose  that  in  a  multi-stage  process  batches  of  a 
ton  are  split  into  four  sub-batches  which  are  processed  simultaneously  (approxi- 
mately) in  four  reactors,  and  we  wish  to  compare  the  effects  of  four  different 
treatments  at  this  latter  stage.  With  the  four  treatments  on  the  four  sub-batches 
in  each  batch  we  have  the  randomised  block  technique,  the  allocation  of  the  four 
treatments  to  the  four  reactors  being  decided  by  randomisation.  It  is  possible, 
however,  to  arrange  that  with  such  treatment  each  of  the  four  reactors  occurs 
once  and  once  only,  so  that  the  treatment  averages  are  completely  independent 
of  any  possible  differences  between  reactors.  Such  an  arrangement  is  that  in 
Table  1.5,  where  the  letters  refer  to  the  treatments. 

(8)  See,  however,  the  end  of  Chapter  XII  (e). 
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Table  1.5 


Batch 

Reactor  1 

Reactor  2 

Reactor  3 

Reactor  4 

1 

A 

B 

C 

D 

2 

C 

D 

A 

B 

3 

B 

A 

D 

C 

4 

D 

C 

B 

A 

Examining  Table  1.5,  it  will  be  seen  that  the  four  sub-batches  of  each  batch 
include  all  four  treatments.  At  the  same  time,  with  each  of  the  four  treatments 
all  four  reactors  occur  once  and  only  once  and  all  four  batches  occur  once  and 
only  once.  Possible  errors  due  to  differences  between  Batches  and  between 
Reactors  are  thus  eliminated  both  from  the  averages,  thus  allowing  unbiassed 
estimates  to  be  obtained,  and  also  from  the  error,  thus  making  the  latter  a  minimum. 

The  computation  of  the  results  of  a  Latin  Square  is  discussed  in  Chapter 
XII  (g). 

(f)  Balanced  Incomplete  Blocks 

In  the  ordinary  Randomised  Block  the  block  must  be  large  enough  to  take 
one  complete  set  of  treatments.  It  may  happen,  however,  that  we  wish  to  compare 
more  treatments  than  can  be  got  into  one  block. 

Suppose  we  have  raw  material  in  batches  that  are  only  large  enough  to  allow 
4  experiments  to  be  carried  out  on  one  batch,  and  further  suppose  that  there  is  an 
appreciable  variance  between  batches.  If  we  wish  to  carry  out  a  comparison  of 
2,  3,  or  4  treatments  this  between  batch  variance  would  not  worry  us,  as  each  set  of 
2,  3  or  4  treatments  could  be  carried  out  on  the  same  batch.  If,  however,  we 
wished  to  compare  5  or  more  treatments,  then  one  or  more  of  these  would  have 
to  be  carried  out  on  a  different  batch  from  that  on  which  the  first  4  treatments 
were  tested,  and  the  comparison  would  be  invalidated  by  the  confusing  element 
of  the  between  batch  variance.  The  standard  method  of  avoiding  this  difficulty 
is  to  select  one  of  the  treatments  as  a  "standard,"  and  include  it  in  each  batch. 
All  treatments  are  then  compared  directly  with  the  standard,  and  of  the  other 
comparison  some  are  direct  through  the  treatments  occurring  in  the  same  block 
and  others  are  indirect,  from  treatment  M  to  the  standard  in  its  block  to  the 
standard  in  the  other  block  concerned  to  the  treatment  N. 

Although  this  procedure  does  work  it  is  by  no  means  very  satisfactory,  as 
there  are  three  different  errors  for  the  three  different  types  of  comparison.  Further, 
it  is  not  the  most  efficient  possible. 

Consider  the  alternative  design  in  Table  1.6  where  we  have  the  five  treatments 
replicated  four  times  in  five  batches  of  four. 

Table  1.6 


Batch  Number 

1 

2 

3 

4 

5 

A 

A 

A 

A 

B 

B 

B 

B 

C 

C 

C 

C 

D 

D 

D 

D 

E 

E 

E 

E 
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There  are  a  number  of  points  about  this  design  : — 

(a)  Each  treatment  occurs  once  and  only  once  in  4  of  the  5  batches. 

(b)  Any  specified  pair  of  treatments,  e.g.  A  and  B,  or  C  and  E,  occur  together 
in  3  of  the  5  batches.  Thus  direct  comparisons  between  B  and  D  are 
possible  in  Batches  1,  3  and  5. 

(c)  Direct  comparisons  between  a  specified  pair  of  treatments  is  not  possible 
in  the  other  2  batches  ;  thus  there  is  no  D  in  Batch  2  and  no  B  in  batch  4. 
Nevertheless  a  very  satisfactory  comparison  can  be  made,  for  A,  C  and  E 
occur  in  both  Batches  2  and  4  and  we  can  in  effect  use  the  average  of 
these  three  as  a  "standard."  It  will  be  noted  that  to  use  the  average  of 
three  as  a  standard  is  more  satisfactory  than  using  a  single  individual. 

These  experimental  designs  are  known  as  balanced  incomplete  blocks.  They 
are  balanced  because  each  treatment  occurs  to  exactly  the  same  extent :  they  are 
incomplete  because  no  block  contains  the  full  number  of  treatments.  They  suffer 
from  the  restriction  that  balanced  arrangements  are  not  possible  for  all  experi- 
mental set-ups.  Broadly  speaking,  if  we  fix  the  number  of  treatments  that  we 
wish  to  compare,  and  if  the  number  of  experiments  per  batch  (or  "block")  is  also 
fixed,  then  the  number  of  replication  of  each  treatment  is  thereby  determined. 
This  is  the  principal  disadvantage  of  these  designs  :  the  number  of  replications 
they  require  may  be  greater  than  we  think  are  necessary  to  attain  sufficient  accuracy. 

Some  of  the  designs  are  more  satisfactory  from  this  point  of  view  than  others. 
In  general,  for  a  given  number  of  treatments,  the  larger  the  block  we  can  use  the 
smaller  the  number  of  replications  that  are  required  to  establish  balance.  Thus 
if  we  have  21  treatments  to  be  compared,  then  working  with  blocks  of  3  we  require 
10  replications.  Working  with  blocks  of  4  we  could  use  the  scheme  for  25  treat- 
ments (duplicating  4  of  the  most  important  if  we  wished  to  bring  the  total  up  to  25) 
which  would  require  8  replications.  Blocks  of  5  are  a  very  satisfactory  size  as 
only  5  replications  are  required.  It  is  a  curious  property  that  blocks  of  6  and  7 
are  less  desirable,  requiring  8  and  10  replications  respectively.  The  exact  require- 
ments in  any  particular  instance  are  a  consequence  of  the  combinatorial  properties 
of  numbers. 

The  computation  for  these  designs  is  discussed  in  Chapter  XIII. 


(g)  Youden  Squares 

Let  us  go  back  to  the  example  used  in  Section  (e)  on  the  Latin  Square  ;  there 
we  had  each  batch  being  split  into  four  sub-batches,  each  to  be  used  in  four 
reactors.  Using  the  Latin  Square  layout,  it  was  possible  to  have  each  of  the  four 
treatments  that  we  wished  to  compare,  arranged  in  such  a  way  that  they  all  occurred 
once  and  only  once  with  each  batch  and  also  once  and  only  once  with  each  reactor. 
Batch  differences  and  reactor  differences  were  thereby  eliminated  from  our  com- 
parison of  the  treatment  effects. 

Now  suppose  that  the  number  of  treatments  to  be  compared  is  five  and  not 
four.  We  can  no  longer  use  the  4x4  Latin  Square.  If  there  were  five  sub- 
batches  to  the  batch  and  five  reactors  we  could  use  a  5  X  5  Latin  Square,  but  we 
are  supposing  that  we  are  limited  to  four  sub-batches  per  batch. 

A  solution  is  to  adopt  the  Balanced  Incomplete  Block  design  outlined  in  the 
previous  section.  We  can  use  four  of  the  five  treatments  in  each  batch  in  the 
design  given  in  Table  1.6.  In  order  to  deal  with  the  uncontrolled  variability  due 
to  differences  between  reactors  we  must  allot  the  treatments  in  each  batch  to  the 
four  reactors  purely  at  random.  This  will  avoid  our  obtaining  faulty  conclusions, 
but  this  variability  due  to  differences  between  reactors  will  enter  into  the  residual 
error  against  which  we  compare  the  observed  effects.  The  precision  of  the 
experiment  will  therefore,  be  reduced,  and  in  order  to  obtain  results  of  a  given 
limit  of  error  it  may  be  necessary  to  repeat  the  experiment  several  times. 
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Incidentally,  it  will  be  obvious  that  should  we  know,  or  suspect,  that  the 
variability  between  reactors  is  greater  than  that  between  batches,  then  it  would 
be  preferable  to  make  the  "blocks"  the  reactors  rather  than  the  batches,  i.e.  we 
would  carry  out  a  randomised  block  design,  with  all  five  treatments  occurring  in 
random  order  on  each  reactor.  The  variability  due  to  differences  between  batches 
would  then  enter  into  the  residual  error  and  therefore  inflate  it. 

'It  is  possible,  however,  to  do  better  than  the  arrangements  of  either  of  the 
two  previous  paragraphs  and  remove  both  sources  of  variability,  batches  and 
reactors,  from  the  residual  error,  by  the  use  of  the  design  known  as  Youden's 
Square.  The  title  is  not  very  felicitous,  incidentally,  as  in  the  usual  manner  of 
representing  them  they  are  rectangular. 

Referring  to  Table  1.6,  a  little  thought  will  show  that  it  is  possible  to  re-arrange 
the  order  of  treatments  in  each  batch  so  that  each  row,  which  we  will  make 
correspond  to  reactor,  contains  all  five  treatments. 

Table  1.7 


Batch  Number 

1 

2          3          4 

5 

Reactor  1 

A 

E         D         C 

B 

Reactor  2 

B 

A         E         D 

C 

Reactor  3 

C 

B         A         E 

D 

Reactor  4 

D 

C         B          A 

E 

It  is  now  possible  to  remove  variability  due  both  to  batches  and  to  reactor 
from  the  residual  error,  and  the  experiment  will  therefore  have  the  greatest  possible 
accuracy.  For  using  a  design  such  as  that  above,  the  treatment  should  be  allocated 
to  the  letter  at  random  and  also  the  reactors  should  be  allocated  to  the  row  at 
random  and  also  the  columns  corresponding  to  the  batches  should  be  taken  in  a 
random  order. 

Designs  of  this  type  are  only  possible  when  the  number  of  blocks  equals  the 
number  of  treatments.  This  restriction  is  not  so  severe  as  might  appear  at  first 
sight  as  many  of  the  designs  most  likely  to  be  used,  in  particular  those  requiring 
the  fewest  replications,  satisfy  this  condition.  Where  these  conditions  can  be 
satisfied  the  number  of  replications  is  equal  to  the  number  of  units  per  block. 
Thus  four  or  seven  treatments  can  be  compared  in  blocks  of  three  with  three 
replications,  either  five,  seven,  or  thirteen  treatments  in  blocks  of  four  with  four 
replications,  six  or  eleven  treatments  in  blocks  of  five  with  five  replications,  and 
so  on.     Several  examples  are  given  in  Chapter  XIII  (e). 


(h)  Lattice  Squares 

The  outstanding  disadvantage  of  the  Balanced  Incomplete  Blocks  is  that  to 
obtain  balance  the  number  of  replications  necessary  becomes  rather  high  as  the 
number  of  treatments  increases.  Generally,  p2  treatments  require  (p  +  1)  repli- 
cations, e.g.  36  treatments  require  7  replications,  and  this  we  may  consider  to 
be  more  than  is  justified. 
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The  Lattice  Square  is  probably  the  most  suitable  device  for  this  situation. 
Suppose  we  have  25  treatments  which  we  wish  to  compare.  We  can  write  them 
down  in  the  form  of  a  square  as  in  Table  1.7. 

Table  1.7 


11 

12 

13 

14 

15 

21 

22 

23 

24 

25 

31 

32 

33 

34 

35 

41 

42 

43 

44 

45 

51 

52 

53 

54 

55 

These  25  treatments  can  be  tested  in  blocks  of  5  by  taking  first  the  rows  as 
blocks  and  then  the  columns  as  blocks.  Repeating  each  set  gives  a  total  of  4 
replications  for  eacht  reatment.  It  will  be  noted  that  these  designs  are  not 
balanced.  For  example,  treatment  11  occurs  in  the  same  blocks  as  treatments 
12,  13,  14  and  15  and  also  21,  31,  41  and  51,  but  never  with  the  remaining  16 
treatments.  Satisfactory  comparisons  are  nevertheless  possible  through  inter- 
mediates, and  with  the  appropriate  algebraic  treatment  the  designs  are  very 
efficient.  (*) 

The  chief  virtue  of  these  lattice  designs  is  that  the  number  of  replications 
required  is  independent  of  the  number  of  treatments.  Thus,  for  example,  to 
test  49  treatments  we  require  only  4  replications,  whereas  using  the  Balanced 
Incomplete  Block  technique  we  require  8  replications. 

(i)  The  Nature  of  "Blocks" 

We  have  been  discussing  in  the  previous  sections  devices  for  circumventing 
the  difficulties  arising  through  the  restricting  size  of  the  "blocks"  available.  The 
concept  of  block  is  an  extremely  general  one,  but  the  following  are  some  examples  : 

(a)  In  the  science  of  agriculture,  of  course,  the  block  is  made  up  of  plots 
of  ground  immediately  adjoining  each  other,  it  having  been  found  that 
plots  close  together  tend  to  have  fertilities  more  comparable  than  plots 
widely  separated. 

(b)  For  experiments  on  the  nutrition  of  animals,  it  has  been  found  that 
animals  from  the  same  litter  resemble  each  other  in  their  response  to  a 
given  diet  more  than  animals  from  other  litters. 

(c)  In  experiments  on  the  purification  of  the  products  of  fermentations,  each 
fermentation  may  be  conceivably  different  from  the  others  in  some  slight 
degree,  so  may  be  considered  as  a  block. 

(d)  If,  in  any  experiment  in  which  the  personal  idiosyncracies  of  the  operator 
may  effect  the  result,  the  total  number  of  treatments  to  be  compared 
involves  the  use  of  more  than  one  operator,  then  clearly  the  operator  is 
the  block. 

(e)  There  are  many  other  possible  sources  of  inhomogeneity  in  the  experi- 
mental background  that  may  be  considered  as  blocks.  For  example,  in 
a  comparison  of  treatments  involving  use  of  a  thermostat  or  incubator 

(*)  This  design  will  not  be  discussed  in  detail  as  the  author  has  not  found  instances 
in  industrial  work  where  the  comparison  of  such  large  numbers  of  treatments  is  necessary. 
It  is  described  here,  however,  as  it  is  the  simplest  of  a  series  of  such  designs.  A  general 
description  with  references  is  by  F.  Yates  :  Empire  Journal  of  Experimental  Agriculture 
VIII,  page  223,  1940.  Descriptions  and  methods  of  computation  of  several  are  in  C.  H. 
Goulden  :    "Methods  of  Statistical  Analysis"  (John  Wiley)  1939. 

16 


it  is  obviously  preferable  if  possible  to  place  all  the  units  with  all  the 
treatments  in  the  same  thermostat  or  incubator  at  the  same  time.  Errors 
due  to  lack  of  perfection  in  the  equipment  will  then  be  at  an  absolute 
minimum.  It  may  be,  however,  that  the  thermostat  is  not  large  enough 
to  take  all  the  treatments  at  one  time.  We  then  either  have  to  use  several 
thermostats,  in  which  case  the  thermostat  is  obviously  the  block,  or  to 
use  the  same  thermostat  at  successive  intervals  of  time,  in  which  case 
the  time  is  obviously  the  block. 

(g)  In  any  experiment  in  which  a  number  of  treatments  are  to  be  compared 
and  in  which  the  material  on  which  they  are  applied  may  change  slightly 
with  time,  and  in  which  only  a  given  number  of  treatments  can  be  tried 
out  in  one  day,  then  the  day  is  the  block.  Examples  are  where  there 
might  be  differential  leakage  from  a  gas  holder  in  the  case  of  a  mixture 
of  gases,  or  where  a  broth,  e.g.  of  penicillin,  may  be  appreciably  unstable. 

(h)  In  many  cases  there  are  no  clear-cut  distinctions  of  the  type  illustrated 
above,  but  nevertheless  advantages  will  result  from  splitting  the  experi- 
ment up  into  blocks.  In  general  experiments  done  as  close  together  as 
possible  in  time  and  space  will  resemble  each  other  more  than  those 
remote  from  them.  The  air  temperature,  atmospheric  pressure,  humidity, 
adjustment  of  the  machines,  mood  and  behaviour  of  the  operators,  and 
many  other  less  tangible  factors  will  all  tend  to  vary  less  within  a  short 
space  of  time  than  over  a  longer  period.  Accordingly,  if  the  experiment 
can  be  so  split  up  into  blocks  the  effective  error  will  be  appreciably 
reduced. 

(j)  Multiple  Factor  Experiments 

The  type  of  experiment  which  we  have  been  discussing  up  to  now  has  had  only 
one  "independent  variable"  or  "factor."  In  much  industrial  work,  of  course, 
there  are  a  relatively  large  number  of  independent  variables  which  we  wish  to 
investigate,  and  we  therefore  require  a  further  set  of  techniques. 

The  classical  ideal  of  experimentation  is  to  have  all  the  independent  variables 
but  one  constant.  It  is  frequently  not  recognized  that  this  may  sometimes  be 
far  from  the  ideal,  for  in  order  to  get  a  fair  assessment  of  the  effects  of  varying 
the  particular  variable  one  must  allow  the  others  to  vary  over  their  full  range  as 
well.  If  they  were  held  constant,  they  would  have  inevitably  to  be  held  constant 
at  completely  arbitrary  values.  Thus  varying  factor  A  from  its  normal  value 
Ai  to  some  other  value  A2  may  produce  a  given  change  in  the  quality  of  the  pro- 
duct when  factor  B  is  at  a  value  of  Bj  but  a  different  change  in  the  quality  of  the 
product  when  factor  B  is  at  a  value  B2.  The  factorial  design  of  experiment  is 
designed  to  detect  this  type  of  effect,  and  at  the  same  time  gives  maximum  effi- 
ciency, i.e.  gives  the  maximum  amount  of  information  about  the  system  being 
experimented  upon  for  a  given  amount  of  work. 

The  technique  of  factorial  experimentation  was  developed  in  the  science  of 
agriculture  in  the  twenties  and  thirties  largely  at  the  Rothamsted  Experiment 
Station  by  R.  A.  Fisher  and  his  colleagues. 

Fisher's  approach  to  experimentation  differs  in  two  fundamental  aspects 
from  the  classical  one-variable-at-a-time  ideology.  Firstly,  he  stresses  the 
importance  of  obtaining  an  accurate  estimate  of  the  magnitude  of  the  error  varia- 
tion, rather  than  its  minimisation.  The  accurate  estimate  of  the  error  variation 
is  necessary  in  order  to  apply  an  exact  test  of  significance. 

Secondly,  Fisher  emphasises  the  advantages  to  be  gained  from  including  in 
the  same  experiment  as  many  as  possible  of  the  factors  whose  effects  are  to  be 
determined.     The  advantages  of  this  are — 

(a)  much  greater  efficiency  ;    estimates  of  a  given  standard  of  accuracy  for 
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the  effects  can  be  obtained  in  a  much  smaller  total  number  of  observa- 
tions. When  the  experiments  are  on  the  full  industrial  scale  this  may 
represent  a  very  considerable  reduction  in  their  cost ; 

(b)  information  is  given  on  the  extent  to  which  the  factors  interact,  i.e.  the 
way  in  which  the  effect  of  one  factor  is  affected  by  the  other  factors  ;  the 
experiments  will  therefore  give  a  wider  inductive  basis  for  any  conclusions 
that  may  be  reached.  To  give  an  illustration  of  this,  imagine  one  man 
carrying  out  a  comparison  of  two  catalysts  for  a  contact  oleum  plant,  the 
conversion  efficiency  being  the  dependent  variable  of  interest.  He  would 
hold  constant  all  his  other  factors,  e.g.  in  particular  gas  inlet  temperature 
and  rate  of  loading.  He  might  conclude  from  his  experiments  that 
catalyst  A  was  better  than  catalyst  B.  Meanwhile  a  second  experimenter 
could  conclude  from  experiments  on  inlet  gas  temperature  that  420°C. 
was  the  best ;  the  experiments  on  catalyst  may  have  been  made  with  a 
temperature  of  450°C,  and  hence  were  invalid.  The  first  experimenter 
could  point  out  that  the  second  experimenter's  work  was  made  with 
catalyst  B  and  hence  his  conclusions  are  not  valid.  Similarly,  a  third 
set  of  experiments  on  rate  of  loading  might  show  that  the  rates  chosen 
for  the  first  two  sets  of  experiments  were  incorrect,  and  hence  the  con- 
clusions drawn  therefrom  are  invalid.  However,  the  loading  experi- 
ments might  have  been  carried  out  at  the  wrong  temperature  and  with 
the  wrong  catalyst,  and  hence  would  not  be  valid.  It  is  possible,  of 
course,  that  420°C.  is  the  best  temperature  irrespective  of  which  catalyst 
and  loading  rate  was  employed,  and  that  catalyst  A  is  the  best  catalyst 
irrespective  of  which  temperature  and  loading  rates  were  employed,  and 
that  the  best  rate  of  loading  remains  the  best  irrespective  of  which  catalyst 
and  temperature  are  employed.  To  make  these  assumptions  without 
demonstrating  their  validity  is  quite  unjustified,  however.  This  type  of 
effect,  namely  the  effect  of  one  factor  being  dependent  upon  the  value  of 
another  factor,  is  known  as  an  "interaction." 

(k)  The  Three  Factor  Experiment 

(i)  Classical 

Let  us  consider  a  hypothetical  experiment,  in  which  we  wish  to  investigate 
the  effects  of  three  independent  variables  P,  Q  and  R  (which  may  be  temperatures; 
pressures,  flow  rates,  concentrations,  etc.)  upon  a  dependent  variable  x  (which 
may  be  yield,  purity,  etc.). 

Suppose  that  in  the  first  instance  it  will  be  adequate  to  investigate  them  each 
at  only  two  levels.  Thus  suppose  the  normal  values  for  the  process  are  Plf  Qlf 
and  Rlf  and  we  wish  to  find  the  effects  of  increasing  them  to  P2,  Q2  and  Ra 
respectively.     How  would  we  carry  out  this  experiment  in  the  classical  style  ? 

We  would  first  do  an  experimental  control  with  values  Px  Qx  Rx.  A  value 
of  x  would  be  obtained  which  we  denote  by  (Pt  Qx  Rx)x. 

To  obtain  the  effect  on  x  of  changing  P  from  Px  to  P2  (which  we  will  symbol- 
ically denote  by  (Px  —  P2)x  we  do  an  experiment  with  values  P2  Qx  RL.     A  value 
of  x  will  be  obtained  which  we  denote  by  (P2  Qx  Rj)x.     It  is  then  clear  that 
(Px  -  P2)x  =  (Px  Qx  Rx)x  -  (P2  Qx  Rx)x 

Similarly,  experiments  at  Px  Q2  Rx  and  Px  Qx  R2  give  us 

(Qx  -  Q2)x  =  (Px  Qx  Ri)x  -  (Px  Q2  Rx)x 
and  (Rx  —  R2)x  =  (Px  Qx  Rx)x  -  (Px  Qx  R2)x 

Now  a  very  important  point  is  that  each  of  these  experiments  would  have 
to  be  repeated  not  less  than  once,  for  without  not  less  than  two  observations  it 
is  quite  impossible  to  make  any  estimate  of  experimental  error.     Without  such 
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an  estimate  it  is  quite  impossible  to  say  whether  any  apparent  difference  between, 
say,  (Px  Qi  Ri)x  and  (P2  Qi  Ri)x  is  real  or  is  due  to  the  errors  of  measurement 
and  sampling,  etc.  We  therefore  require  our  four  experiments  to  be  repeated 
once,  making  eight  observations  in  all.  Each  level  of  each  effect  will  be  given 
by  two  observations,  i.e.  we  will  be  comparing  the  mean  of  two  observations  with 
the  mean  of  two  observations. 

No  information  is  given  by  this  experiment  as  to  any  possible  interactions 
between  the  factors.  Thus  for  example,  (Rx  —  R2)x  with  P  at  Px  may  well  be 
different  from  (Rj  —  R2x)  with  P  at  P2,  but  our  experiment  is  quite  unable  to 
detect  such  an  effect. 

(ii)  Factorial 

With  the  factorial  design  we  should  carry  out  experiments  at  all  combinations 
of  P,  Q,  and  R,  namely  (Px  Qx  RJ,  (P2  Qx  RJ,  (Px  Q2  RJ,  (P2  Q2  Rx),  (Pa  Qx  R2), 
(P2  Qi  R2X  (Pi  Q2  Ra)>  and  (P2  Q2  R2),  making  a  total  of  eight  observations,  the 
same  total  as  in  the  classical  design. 

It  may  make  it  easier  to  grasp  the  significance  of  the  above  set  of  combinations 
if  we  realize  that  they  are  the  co-ordinates  of  eight  corners  of  a  cube  whose  axes 
are  P,  Q,  and  R  (Figure  1). 
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Figure  1. 


Having  such  a  picture  we  can  now  visualise  how  we  obtain  our  estimates  of 
the  three  main  effects.  We  obtain  our  estimate  of  (Px  —  P2)x  by  comparing  the 
average  of  the  Px  plane  with  the  average  of  the  P2  plane  ;  in  detail,  the  average 
of  (Px  Qx  Rx)x,  (Pi  Q2  Ri)x,  (Pi  Q2  R2)x,  (Pi  Qi  R2)x  with  the  corresponding  set 
(P2  Qi  Ri)x,  (P2  Q2  Ri)x,  (P2  Q2  R2)x,  (P2  Qi  R2)x.  Although  the  other  two 
variables  Q  and  R  are  varying,  they  are  varying  in  such  a  way  that  they  correspond 
in  the  two  sets,  and  thus  to  a  first  approximation  their  disturbing  effects  cancel 
out.     The  other  main  effects  (Qx  —  Q2)x  and  (R!  —  R2)x  are  obtained  similarly. 

The  first  advantage  of  our  factorial  design  is  that  our  main  effects  are  obtained 
as'  the  difference  between  the  mean  of  one  set  of  four  observations  and  the  mean 
of  another  set  of  four  observations.  In  the  classical  design  our  main  effects  were 
the  differences  between  means  of  two  observations.  We  have  thus  obtained 
double  the  accuracy  for  the  same  number  of  experiments.     If  this  experiment  is 
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being  carried  out  on  the  full  industrial  scale  at  considerable  cost  and  trouble,  this 
factor  can  be  of  the  greatest  economic  importance. 

The  second  advantage  of  the  factorial  design  is  that  it  provides  us  with  an 
estimate  of  the  possible  interactions  between  the  main  effects. 

Consider  our  cube  again.  To  obtain  the  interaction  between  P  and  R, 
average  the  front  plane  with  the  back  plane,  i.e.  we  average  our  Q.  This  gives 
us  a  table  as  belo_w,  the  superscript  Q  indicating  that  figures  are  averaged  over  Q. 

(p,  r,)9  (p.  ro9 

(P.  R*)9  (P.  R*)9 

The  top  line  gives  us  (Px  —  P2)x  at  R  =  Rx  and  the  bottom  line  gives  us 
(Px  —  P2)x  at  R  =  R2.  Since  each  figure  in  the  table  is  the  average  of  two  results, 
we  have  a  valid  estimate  of  error,  and  can  determine  whether  (Px  —  P2) *m,  the 
superscript  indicating  that  the  figure  is  obtained  with  R  at  Rlf  is  significantly 
not  equal  to  (Px  —  P2) ***.  If  they  do  differ  significantly,  then  there  is  an  inter- 
action between  P  and  R.     The  other  interactions  can  be  derived  similarly. 

The  computation  of  the  three  factor  experiment  is  discussed  in  Chapter 
XI  (d). 


(1)  Higher  Factorial  Experiments 

We  have  been  discussing  a  three  factor  experiment,  all  factors  being  at  two 
levels.  We  can  represent  this  symbolically  as  2  X  2  X  2,  or  28.  The  extension 
to  a  fourth  factor  is  obvious,  though  not  capable  of  a  geometrical  representation. 
If  the  fourth  factor  is  represented  by  S,  then  we  can  regard  the  eight  experiments 
so  far  mentioned  as  having  been  carried  out  with  S  at  its  lower  value,  Sj,  and  to 
introduce  the  fourth  factor  S  we  carry  out  a  similar  eight  experiments  with 
S  now  at  its  upper  level  S2.  The  estimate  of  the  main  effect  of  S  is  then  given 
by  the  difference  between  the  average  of  the  first  eight  with  S  =  Sx  and  the  average 
of  the  second  eight  with  S  =  S2.  The  main  effects  of  the  other  three  factors 
are  obtained  similarly.  There  are  now  six  possible  first  order  interactions,  taking 
the  factors  in  pairs,  namely  PQ,  QR,  PR,  PS,  QS,  RS,  and  to  obtain  each  of  these 
we  average  over  the  two  factors  not  mentioned  in  the  interaction,  e.g.  for  PQ  we 
average  over  R  and  S. 

It  will  be  apparent  that  the  introduction  of  the  fourth  factor  has  improved 
the  experiment  appreciably.  The  main  effects  are  given  as  a  result  of  the  com- 
parison of  averages  of  eight  results  (instead  of  four) :  the  first  order  interactions 
are  given  as  a  result  of  the  comparison  of  averages  of  four  results  (instead  of  two). 
Further,  it  is  now  possible  to  check  on  the  possible  existence  of  what  are  known  as 
second  order  interactions.  As  an  example  of  one  of  these,  consider  the  interaction 
PQ  :  it  is  possible  that  its  magnitude  may  depend  on  the  value  of  R,  being  different 
when  R  =  Rx  from  what  it  is  when  R  =  R2.  Such  an  effect  we  call  a  second 
order  interaction  and  denote  as  PQR  or  P  X  Q  X  R.  Actually  it  is  found  that 
such  interactions  are  symmetrical  with  respect  to  three  factors.  PQR  can  be 
regarded  as  the  interaction  of  PQ  with  R,  QR  with  P,  or  RP  with  Q.  With  four 
factors  there  are  four  possible  second  order  interactions,  PQR,  QRS,  RSP  and 
PQS.  Our  four  factor  experiment  will  estimate  the  existence  of  all  these  possible 
effects  (the  procedure  is  given  in  Chapter  XI  (e)). 

It  will  be  noted  that  the  four  factor  experiment  is  even  more  efficient  relative 
to  the  classical  design  than  the  three  factor  experiment,  which  achieved  double 
the  accuracy  for  the  main  effects. 

The  four  factor  experiment  requires  2*  =  16  runs,  and  its  main  effects  are 
given  as  the  comparison  of  the  average  of  8  runs  with  the  average  of  8.     The 
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nearest  equivalent  classical  experiment  is  to  carry  out  the  five  combinations 

-Pi  Qi  Ri  Si         P2  Qi  Ri  S, 

Pi  Q2  Ri  Sx 

Pi  Qi  R2  S, 

Px  Qx  Rx  S2 

each  three  times  which  would  require  a  total  of  15  runs.  The  main  effects  are 
then  given  as  the  comparison  of  averages  of  3  runs,  which  are  markedly  inferior 
to  the  factorial  experiment.  The  latter  of  course  also  checks  on  the  existence  of 
the  six  possible  first  order  interactions  and  the  four  possible  second  order  inter- 
actions. 

It  is  clear  where  the  immense  superiority  of  the  factorial  experiment  is  arising. 
In  the  classical  experiment  all  the  12  observations  on  the  last  four  combinations 
are  each  only  being  used  once,  to  contribute  information  on  only  one  effect.  Thus 
the  three  replications  of  P2QiRiSi  contribute  information  only  on  the  main  effect 
of  P  and  add  nothing  whatever  to  our  knowledge  of  the  other  three  factors  or  to 
the  various  possible  interactions.  Similarly  for  PiC^RiS^  P1Q1R2S1  and 
RiQiRiS2-  Only  the  combination  of  PiQiRiSx  is  used  several  times.  In  the 
factorial  experiment,  on  the  other  hand,  every  observation  is  used  many  times 
over  in  a  different  manner  each  time. 

The  extension  to  more  than  four  factors  or  to  factors  at  more  than  two  levels 
follows  naturally. 

Finally,  it  will  have  been  noted  that  the  larger  factorial  experiments  require 
a  fair  number  of  experiments,  e.g.  a  24  or  2x2x2x2  experiment  requires 
24  =  16  runs.  This  number  may  be  larger  than  can  be  comfortably  put  into  a 
block.  Methods  have  been  worked  out  which  evade  these  difficulties  and  enable 
us  to  put  these  16  runs  into  2  blocks  of  8  runs  or  into  4  blocks  of  4  runs.  These 
methods  are  known  as  confounding  and  are  discussed  in  Chapter  XIV. 

(m)  Randomisation 

In  any  experimental  design  it  is  of  the  greatest  importance  that  the  various 
units  of  the  experiment  should  be  randomised.  Sometimes  this  is  randomisation 
in  space,  as  the  layout  of  plots  in  a  field  experiment  or  the  positioning  of 'flasks  in 
an  incubator  or  thermostat.  It  may  be  randomisation  in  time,  as  where  a  series 
of  batch  experiments  are  carried  out  on  a  single  reactor.  Where  there  are  several 
reactors  in  use,  the  question  of  which  experiment  is  done  on  which  reactor  should 
always  be  decided  by  randomisation. 

In  a  simple  unconfounded  factorial  experiment  the  position  of  any  unit  can 
be  randomised  by  giving  each  unit  a  serial  number  and  then  choosing  these  numbers 
randomly.  If  the  experiment  is  confounded,  then  we  should  randomise  the  order 
of  the  blocks  and  then  randomise  the  order  of  units  within  the  block. 

Where  we  are  using  a  Latin  Square,  the  treatment  should  be  allocated  to 
the  letters  randomly,  and  the  rows  and  columns  should  be  allocated  to  their 
respective  factors.  With  balanced  incomplete  blocks  the  treatments  should  be 
allocated  to  the  letters  randomly,  and  the  blocks  then  arranged  in  a  random  order. 

For  carrying  out  the  operation  of  randomisation  the  simplest  method  is,  if 
we  have  to  randomise  16  units,  to  label  16  cards  from  1  to  16  and  draw  them 
from  a  hat.  It  is  generally  more  convenient  to  use  a  table  of  random  numbers 
for  the  purpose.  Table  VII  in  the  Appendix  is  the  first  third  of  the  table  of 
random  numbers  in  Fisher  and  Yates'  Tables. 

There  are  various  methods  of  using  the  tables.  For  example,  to  select  a 
series  of  numbers  from  the  numbers  1  to  16  in  a  random  order,  we  can  choose 
any  row,  column,  or  diagonal  of  two  figure  numbers  and  read  along  recording 
the  numbers  1  to  16  as  they  occur.  This  is  a  lengthy  process,  however,  and  it  is 
quicker  to  divide  each  two  figure  number  by  16  and  take  the  remainder.  These 
will  run  from  0  to  15,  so  we  need  to  consider  0  as  16,  which  thus  gives  us  the 
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numbers  from  1  to  16.  The  two  figure  random  numbers  go  from  00  to  99,  and 
this  means  that  the  remainders  0,  1,  2  and  3  have  an  extra  chance  of  occurring 
since  they  can  be  derived  from  the  two  figure  numbers  96,  97,  98  and  99  which 
form  the  beginning  of  the  seventh  cycle.  To  avoid  this  the  numbers  96,  97,  98 
and  99  should  be  discarded  when  they  occur.  Similarly  in  choosing  numbers 
from  the  numbers  1  to  27,  in  dividing  bv  27  we  should  omit  the  two  figure  numbers 
81  up  to  99. 

The  arrangement  of  a  set  of  say,  16  units  in  a  random  order  can  be  made 
by  the  process  in  the  previous  paragraph,  rejecting  numbers  after  they  have 
occurred  once.  After  about  half  the  numbers  have  been  selected,  however,  the 
rejections  become  more  frequent  than  the  selections.  Fisher  and  Yates  point  out 
that  this  can  be  avoided  by  making  the  first  selection  from  1  to  16,  which  leaves 
15  treatments.  We  can  now  select  a  number  from  1  to  15,  and  that  will  leave 
14  treatments,  and  so  on. 

Randomisation  in  this  systematic  manner  may  sound  a  lengthy  procedure, 
but  actually  it  can  be  carried  out  in  a  minute  or  two  and  since  it  is  essential  to 
the  obtaining  of  valid  results  it  never  should  be  omitted. 


CHAPTER  II 
FUNDAMENTAL  STATISTICAL  CONCEPTIONS 

(a)  Statistical  Terminology 

Statistics,  like  every  other  science,  has  its  own  words  and  symbols  of  special 
meaning.  These  may  be  confusing  to  the  newcomer,  but  are  almost  essential 
for  the  normal  use  of  its  methods. 

We  apply  our  statistical  methods  to  sets  of  numbers  obtained  by  making 
measurements  upon  some  property  of  our  product  or  process.  The  property 
might  be  yield,  purity,  or  such  qualities  as  the  life  in  days  or  in  tons  throughput 
of  a  piece  of  plant  before  failure.  Or  a  machine  may  be  making  articles  whose 
size,  weight,  or  metallurgical  hardness  we  measure.  This  variable  we  refer  to 
as  the  "dependent  variable,"  and  we  are  frequently  interested  in  the  effect  upon 
it  of  the  conditions  under  which  it  is  obtained.  These  conditions  can  generally 
be  specified  in  terms  of  a  number  of  "independent  variables,"  which  we  can 
refer  to  as  "factors."  Thus  in  many  chemical  processes  the  conditions  can  be 
determined  by  specifying  values  for  the  temperature,  pressure,  time  of  reaction, 
proportions  of  reactants.  A  given  value  for  an  independent  variable  is  referred 
to  as  a  "level."  A  given  observation  on  a  dependent  variable  is  referred  to  as  an 
"individual." 

An  important  concept  is  that  of  "population,"  broadly  speaking  a  large 
number  of  individuals  from  a  particular  source.  We  frequently  wish  to  estimate 
the  mean  value  of  a  property  of  a  whole  set  of  individuals  from  a  small  number 
of  them  :   this  part  withdrawn  from  the  whole  we  refer  to  as  a  "sample." 

The  use  of  a  bar  over  a  symbol  indicates  that  the  average  is  meant — thus 
x  is  the  average  of  the  set  of  x's  considered.  2  indicates  summation,  i.e.  addition, 
of  the  series  of  terms  considered. 

An  awkward  conception  is  "degrees  of  freedom."  It  is  closely  related  to  the 
number  of  observations,  in  general  the  number  of  observations  minus  the  number 
of  constraints  imposed  on  the  system.  An  accurate  definition  is  difficult,  how- 
ever, and  we  will  be  content  with  giving  an  explicit  account  for  each  particular 
application.  It  is  interesting  to  note  that  rather  similar  conceptions  will  be 
familiar  to  physical  chemists  in  the  application  of  the  phase  rule  and  in  the 
quantum  mechanics  of  molecules. 

(b)  Probability 

Probability  is  expressed  on  the  scale  0  to  1,  the  former  implying  that  an 
event  will  certainly  not  occur  (or  that  a  hypothesis  is  not  true)  and  the  latter 
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implying  that  the  event  is  certain  to  occur  (or  that  the  hypothesis  is  definitely 
true).  On  this  scale  a  value  of  0.5  implies  that  the  event  is  as  likely  to  occur 
as  not  (or  that  the  hypothesis  is  as  likely  to  be  true  as  not). 

Frequently  in  practice  this  scale  of  fractions  is  transformed  into  one  of 
percentages,  e.g.  a  probability  of  0.05  is  referred  to  as  a  5%  probability. 

(c)  Populations  :    Tests  of  Significance 

"The  idea  of  an  infinite  population  distributed  in  a  frequency  distribution 
in  respect  of  one  or  more  characteristics  is  fundamental  to  all  statistical  work." 
From  a  limited  experience  of  some  function  or  of  individuals  of  a  given  type 
"we  may  obtain  some  idea  of  the  infinite  hypothetical  population  from  which 
our  sample  is  drawn,  and  so  of  the  probable  nature  of  future  samples  to  which 
our  conclusions  are  to  be  applied.  If  a  second  sample  belies  this  expectation  we 
infer  that  it  is,  in  the  language  of  statistics,  drawn  from  a  different  population  ; 
that  the  treatment  to  which  the  second  sample"  has  been  subjected  "did  make  a 
material  difference  .  .  .  Critical  tests  of  this  kind  may  be  called  tests  of  signi- 
ficance, and  when  such  tests  are  available  we  may  discover  whether  a  second 
sample  is  or  is  not  significantly  different  from  the  first"  (Fisher,  "Statistical 
Methods  for  Research  Workers"). 

.  The  most  common  type  of  test  has  the  form  in  which  we  assume  that  the 
means  (Student's  t  test),  the  variances  (Fisher's  z  test),  or  the  fit  of  the  assumed 
distribution  to  the  actual  data  (Use  of  x2)  are  the  same  for  the  two  samples  and 
calculate  the  probability  of  getting  such  results  as  we  did  get  on  the  basis  of  this 
hypothesis  being  true.  So  long  as  the  probability  is  greater  than  0.05  (5%)  there 
is  no  strong  reason  to  doubt  the  hypothesis.  When  the  probability  is  0.05  how- 
ever, a  value  as  small  as  this  would  only  be  obtained  once  in  20  times  if  there 
were  really  no  difference,  and  hence  it  is  generally  taken  that  this  indicates  that 
the  hypothesis  is  not  true.  Smaller  values  of  the  probability  are,  of  course,  an 
even  stronger  indication  that  the  hypothesis  is  incorrect.  Thus  a  small  probability 
corresponds  to  a  high  degree  of  significance. 

(d)  Significance  Levels 

It  is  entirely  a  matter  of  judgment  what  probability  level  we  take  as  being 
significant.  For  many  purposes  the  5%  level  is  accepted,  but  we  must  realise 
that  this  means  that  if  we  tested  a  long  series  of  possible  effects,  none  of  which 
actually  existed,  in  1  in  every  20  tests  we  would  incorrectly  assert  that  the  effect 
did  exist.  If  we  adopt  a  higher  significance  level,  say  1%,  then  we  reduce  the 
frequency  with  which  we  make  this  type  of  error  but  we  increase  the  frequency 
with  which  we  ignore  effects  which  really  do  exist.  The  best  practical  compromise 
is  probably  to  regard  results  at  or  even  just  under  the  5%  level  of  significance  as 
well  worth  following  up.  For  presentation  of  final  results  and  conclusions, 
however,  one  would  prefer  a  higher  level.  This  can  be  obtained,  if  the  effect 
really  does  exist,  by  obtaining  more  observations. 

Throughout  this  book  we  shall  use  the  word  "significant"  in  the  sense 
"statistically  significant."  "Significant"  accordingly  will  mean  merely  that  the 
data  in  question  is  sufficient  to  establish  that  such  and  such  an  effect  exists  with 
the  stated  level  of  significance  :  it  is  not  meant  to  imply  that  the  effect  is  necessarily 
of  great  importance  from  the  practical  point  of  view,  though  of  course  it  might 
be.  The  converse  also  holds  :  an  effect  may  appear  on  a  given  set  of  data  to  be 
non -significant,  but  this  may  be  merely  because  the  data  is  inadequate  to  show  it 
and  the  effect  may  be  of  the  greatest  practical  importance.  Our  aim  generally 
is  to  give  an  estimate  of  each  effect  together  with  the  so-called  fiducial  or  confidence 
limits  within  which  it  probably  (with  a  stated  level  of  probability)  lies,  though  in 
some  cases  the  solution  of  this  problem  is  by  no  means  simple. 

For  example,  if  an  effect  has  to  be  of  magnitude  15%  to  be  of  practical 
importance,  and  our  data  gives  us  as  our  estimate  of  it  5%  ±  2%,  it  is  clear  that 
the  effect  is  statistically  significant  but  not  of  practical  importance.     Alternatively 
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if  our  result  was  5%  ±  10%,  then  it  is  not  statistically  significant  on  our  present 
data  but  may  still  be  of  practical  importance. 

(e)  Computation 

The  application  of  statistical  methods  generally  involves  a  certain  amount  of 
computation.  A  large  part  of  this  is  the  summing  of  numbers  and  of  their  squares  ; 
this  can  easily  be  performed  with  a  table  of  squares  ("Barlow's  Tables  of  Squares, 
etc."  (E.  and  F.  M.  Spon)  is  a  very  complete  set),  and  with  a  simple  adding  machine. 
For  much  correlation  work  a  calculating  machine  (one  that  will  multiply  and 
divide)  is  almost  essential.  There  are  a  variety  of  models  available  ;  it  can  be 
safely  said  that  the  better  the  machine  the  more  accurately  and  rapidly  can  the 
work  be  carried  out.  An  electric  machine  with  automatic  division  and  multi- 
plication is  really  desirable. 

With  these  machines  it  is  remarkable  with  what  speed  an  apparently  long 
and  tedious  calculation  can  be  carried  through.  Certain  of  them  allow  the  square 
of  a  number  to  be  obtained  with  the  number  being  entered  only  once  on  to  the 
keyboard.  In  forming  the  sum  of  the  squares  of  a  series  of  numbers  the  sum 
of  the  numbers  themselves  can  be  obtained  simultaneously  :  if  this  checks  with 
the  sum  of  the  numbers  obtained  independently,  as  can  often  be  arranged  without 
extra  work,  then  there  is  strong  presumption  that  the  correct  numbers  have  been 
entered  on  to  the  keyboard  and  hence  the  sum  of  squares  is  correct.  Further, 
in  calculating  correlation  coefficients  (see  Chapter  IX)  we  have  a  set  of  pairs  of 
values  of  x  and  y,  and  we  require  Sx,  Sy,  Ex2,  Ey2  and  Lxy.  These  five  sums 
can  be  obtained  simultaneously  with  entering  each  pair  of  values  of  x  and  y  only 
once  on  to  the  keyboard,  in  the  case  where  x  and  y  are  each  two  figure  numbers.  (*) 

(f)  Measures  of  Variability 

In  statistical  work  we  are  largely  concerned  with  the  variability  in  determina- 
tion of  some  property  of  a  system,  e.g.  the  variability  in  yield  or  in  quality  of 
some  chemical  process. 

The  variability  of  a  set  of  observations  can  be  assessed  in  a  variety  of  ways  : — 

(a)  The  "range,"  i.e.  the  difference  between  the  largest  and  the  smallest  of 
the  set,  generally  denoted  by  the  symbol  w. 

(b)  The  "mean  deviation,"  i.e.  the  mean  of  the  deviations  of  the  observations 
from  their  mean. 

(c)  The  "variance."  i.e.  the  sum  of  the  squares  of  the  deviations  of  the 
observations  from  their  mean,  divided  by  one  less  than  the  total  number 
of  observations,  generally  denoted  by  the  symbol  a2. 

Thus  for  the  set  of  observations 

9,  11,  14,  11,  13,  12,  10,  13,  12,  15 

(a)  the  range  would  be  15  —  9  =  6. 

(b)  the  mean  deviation  would  be  obtained  by  calculating  the  mean  as 
(9  +  11  +  14  ...  -f  12  +  15)/10  =  120/10  =  12.0,  the  deviations  as 
(12  —  9)  =  3,  (12—11)  =  1,  (14—12)  =  2,  etc.,  and  taking  the  mean 
of  these  deviations  as  (3  +  1  +  2  +  .  .  .)/10  =  14/10  =  1.4. 

c)  the  variance  denoted  by  a2  would  be  denned  as 
ja_   S(x-x)2 
n  —  1 

where  x  is  the  mean  of  the  x's,  n  is  the  number  of  observations,  (x  —  x)8 
represents  the  square  of  the  deviation  of  an  x  from  the  mean,  and  the 
summation  sign   £  indicates  that  (x  —  x)2  is  to  be  obtained  from  all  the 

0)  A  description  of  advanced  computational  procedures  is  by  H.  O.  Hartley,  Supplement 
to  the  Journal  of  the  Royal  Statistical  Society,  Vol.  VIII,  No.  2,  p.  154,  1946. 
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x's  and  summed  for  them.     Thus  the  variance  of  the  set  of  numbers 
quoted  above  would  be  (the  mean  x  is  here  equal  to  12.0) 
■  _  (12-9)*  +  (12— ll)a  ±  (14-12)'  +  .  •  -  ■  __  30  _ 
a  -  10—1  ]  9  -  "* 

Of  these  three  measures  of  variability,  the  range  is  much  used  for  small 
samples  (n  not  greater  than  10)  in  process  control  work  (the  quality  control  chart, 
discussed  in  Chapter  VIII)  on  account  of  its  arithmetical  simplicity.  It  is  clear 
that  it  does  not  utilise  the  whole  of  the  information  from  the  data,  for  the  detail 
of  the  intermediate  results  does  not  enter  into  the  determination. 

The  mean  deviation  has  the  virtue  of  arithmetical  simplicity,  but  its  properties  are 
such  that  it  is  difficult  to  employ  mathematically,  and  it  must  be  considered  obsolete. 

The  variance  is  generally  greatly  to  be  preferred  as  a  measure  of  variability, 
for  its  properties  are  well  known  and  tests  of  significance  based  upon  it  are  avail- 
able. It  also  has  the  valuable  property  of  being  additive,  i.e.  if  a  process  is  subject 
to  a  number  of  separate  variances  ax2,  a22,  ....  due  to  several  independent  causes, 
then  the  total  variance  a|  is  numerically   equal   to   their   sum,    i.e.    o-J  =  ol* 

-f  <x22  -f  os*  + The  converse  of  this  is  of  particular  importance,  where 

we  analyse  the  total  variance  of  a  system  into  components  attributable  to  various 
parts  of  the  process.  This  is  discussed  at  length  in  Chapter  VII.  A  further 
advantage  of  the  variance  as  a  measure  of  variability  is  that  it  has  the  maximum 
efficiency,  i.e.  it  extracts  more  information  about  the  variability  from  the  data 
than  the  other  measures. 

The  square  root  of  the  variance  is  termed  the  "standard  deviation"  and 
denoted  by  the  symbol  o. 

The  standard  deviation  of  an  average  is  often  referred  to  as  its  standard  error. 

A  term  sometimes  met  in  the  older  literature  but  falling  into  disuse  is 
"probable  error."  This  is  the  limit  such  that  the  probability  that  a  deviation 
exceeds  it  is  50%.  In  the  case  of  a  Gaussian  distribution  (see  Section  (i))  the 
probable  error  is  approximately  0.6745  times  the  standard  error. 

The  relation  between  range  and  standard  deviation  is  discussed  in  Chapter 
VIII  (e). 

(g)  The  Calculation  of  Variances 

It  is  often  troublesome  to  calculate  the  mean  x  and  then  find  the  deviation 
of  each  observation  x  from  it  before  squaring  and  summing  these  deviations. 
We  therefore  use  the  algebraically  equivalent  formula 

o'=[>(x')-(-^]/(n-l) 

For  the  above  set  of  numbers 

q8=[9»-|-ll2+14a  +  ...  +  15a— (9+11+1q,,+  15)2]/(10  — 1) 

=  (1470  —  ^)/9  =  (1470  —  1440)/9  =  30/9  =  3.33 

To  lighten  the  arithmetic  in  calculating  variances,  it  is  frequently  worth 
while  to  take  an  arbitrary  zero  and  transform  all  the  numbers  on  to  the  new  scale. 
Thus  with  the  above  set  of  numbers  we  might  shift  the  axis  by  9  units  so  that 
they  now  become 

0,  2,  5,  2,  4,  3,  1,  4,  3,  6. 
The  variance  of  these  observations  is  then 

a'=[(0'  +  2'  +  5'  +  ...  +  6'-(°  +  2  +  5  +  ---  +  6),)]/(10-l) 

30^ 
=  (120  —  j^/9  =  (120  —  90)/9  =  30/9  =  3.33 

as  before.    Thus  shifting  the  zero  has  not  affected  the  estimate  of  the  variance. 
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It  should  be  clear  that  this  should  be  so  :  we  are  measuring  the  variability  about  the 
mean,  and  even  if  we  added  100  to  all  the  observations  the  variability  remains 
the  same. 

If  the  numbers  had  all  been  100  larger,  then  the  sum  of  squares  would  have 
been  of  the  order  of  100,000,  i.e.  the  arithmetic  would  have  been  rather  unpleasant, 
and  would  have  been  very  much  lightened  by  shifting  the  zero  by  100  units, 
giving  us  our  first  calculation  in  which  the  sum  of  squares  was  1470,  or  by  109 
units,  giving  us  our  second  calculation  in  which  the  sum  of  squares  was  only  120. 

The  business  of  shifting  the  origin  is  largely  a  matter  of  personal  taste.  It 
has  to  be  done  carefully,  but  when  done  the  subsequent  arithmetic  is  much  lighter. 
It  has  two  advantages.  Firstly,  one  will  then  be  working  with  numbers  less  than 
100,  and  with  a  little  practice  it  is  easy  to  carry  the  squares  of  these  numbers  in 
one's  head.  Secondly,  the  squares  of  two  figure  numbers  cannot  have  more  than 
four  figures,  and  since  one  has  only  four  fingers  on  one's  hand  this  makes  entering 
them  on  an  adding  machine  easier. 

If  one  is  using  a  calculating  machine,  however,  particularly  one  with  automatic 
multiplication,  the  shifting  of  the  origin  is  rarely  worth  while. 

(h)  The  Definition  of  Variance 

The  definition  of  variance  is  formally  not  E(x —  x)2/(n — 1),  as  quoted 
above,  but  2(x  —  x)2/n,  and  it  is  quoted  as  the  latter  in  many  statistical  texts. 
In  any  practical  application,  however,  we  should  use  the  formula  with  (n  —  1) 
as  the  denominator. 

The  reason  for  this  apparent  confusion  is  as  follows : — 

In  calculating  S(x  —  x)2  we  should  be  calculating  the  sum  of  squares  of  the 
deviations  from  the  true  mean  of  the  population  from  which  the  sample  is  drawn. 
We  have  only  an  estimate  of  the  mean,  the  mean  of  our  sample,  which  will  not 
in  general  be  identical  with  the  true  mean  of  the  whole  population.  The  sum 
of  squares  of  the  deviations  from  the  mean  of  the  sample  will  be  smaller  than 
the  sum  of  the  squares  of  the  deviations  from  the  true  mean.  To  see  this,  suppose 
we  have  two  observations  1  and  3,  and  that  the  true  population  mean  is  actually  1 . 
The  sum  of  the  squares  of  the  deviations  from  the  true  mean  is  (1  — l)2  +  (3  —  l)2 
=  4 :  working  from  the  true  mean  we  would  use  n  as  the  divisor  to  get  the  esti- 
mate of  the  variance  as  4/2  =  2.  The  sum  of  the  squares  of  the  deviations  from 
the  sample  mean  is  (2  —  l)2  -f-  (3  —  2)2  =  2  :  if  we  used  n  as  the  divisor  the 
estimate  of  the  variance  is  2/2  =  1,  whereas  if  we  use  (n  —  1)  as  the  divisor  we 
get  2/(2 —  1)  =  2  as  our  estimate  of  the  variance.  It  is  apparent  that  the  use 
of  (n  —  1)  instead  of  n  as  the  divisor  helps  to  compensate  for  the  use  of  the  sampje 
mean  instead  of  the  true  mean. 

The  number  (n  —  1)  is  known  as  the  degrees  of  freedom  of  the  variance, 
and  is  used  in  making  tests  of  significance  in  comparing  one  variance  with  another 
(Chapter  IV). 

(i)  Distributions 

It  is  a  matter  of  common  experience  that  we  cannot  manufacture  articles 
identically ;  successive  individuals  differ  slightly  in  their  size,  quality,  etc.  ' 
Similarly,  if  we  make  repeat  determinations  of  some  physico-chemical  quantity, 
e.g.  the  yield  from  some  process,  the  values  we  obtain  will  not  be  exactly  similar  : 
this  variability  would  be  due  not  only  to  the  (possible)  variability  of  the  process, 
but  also  to  the  errors  of  measurement. 

It  can  be  shown  theoretically  that  if  we  assume  that  there  is  a  very  large 
number  of  very  small  errors  all  operating  independently,  then  we  get  a  distribution 
of  the  form  shown  in  figure  2,  known  as  the  Gaussian  distribution. 

This  Gaussian  distribution  has  the  following  valuable  properties  : — 

(a)  it  is  completely  defined  by  two  statistical  parameters,  (i)  the  mean  (fixing 
its  position  along  the  x-axis)  and  (ii)  the  standard  deviation  (fixing  its 
width  or  spread). 
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(b)  There  is  a  relation,  for  which  tables  are  already  calculated,  between  the 
deviation  from  the  mean,  expressed  as  a  multiple  of  a,  and  the  frequency 
with  which  these  deviations  occur.  Thus  to  particularise,  95%  of  the 
individuals  should  lie  between  —  1.96  a  and  -j-  1.96  o  on  each  side  of 
the  mean,  and  99.8%  between  — 3.09  o  and  +  3.09  o  on  each  side  of 
the  mean.  Similarly,  if  we  get  a  deviation  from  the  mean  of  d,  we  cal- 
culate the  quantity  known  as  Student's  t  defined  by  d/a,  and  tables  are 
available  to  show  how  often  a  deviation  of  this  size  should  occur. 

THE    GAUSSIAN   DISTRIBUTION. 
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Figure  2. 


Figure  3  shows  a  typical  distribution  occurring  in  the  chemical  industry.  It 
refers  to  the  quantity  of  one  ingredient  in  a  four  component  mixture.  The 
height  of  the  y-axis  at  any  interval  of  the  x-axis  indicates  the  number  of  batches 
which  had  their  content  of  this  ingredient  in  that  interval.  It  will  be  seen  that 
though  the  experimental  distribution  does  not  follow  exactly  the  theoretical 
Gaussian  curve,  yet  its  features  are  very  similar.  It  must  be  remembered  that 
we  have  here  only  a  total  of  138  observations :  in  any  given  interval  the  number 
to  be  expected  is  quite  small,  therefore,  and  we  must  expect  random  fluctuations 
about  the  expected  value.  As  the  total  number  of  observations  increases  the 
observed  distribution  will  approximate  closer  and  closer  to  the  true  population 
distribution.  In  practice  few  of  the  distributions  found  industrially  are  exactly 
Gaussian,  but  the  departures  from  it  are  generally  slight  and  usually  not  less 
than  about  500  observations  are  necessary  to  establish  significance  for  departures 
from  normality. 
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Most  of  the  tests  of  significance  that  we  make  are  based  on  the  assumption 
that  our  variable  is  distributed  in  the  Gaussian  form.  It  is  fortunate  that  the 
departure  from  it  has  to  be  severe  before  our  tests  of  significance  on  averages 
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Figure  3. 

become  appreciably  inaccurate.  The  discrepancies  for  any  distribution  with  a 
single  hump  somewhere  near  the  middle  and  tailing  off  to  zero  on  either  side  are 
quite  negligible  for  practical  purposes. 

(j)  Grouped  Frequency  Distributions 

If  the  number  of  observations  whose  standard  deviation  is  to  be  calculated 
is  large,  say  40  or  more,  the  labour  involved  can  be  reduced  by  using  what  is 
called  a  grouped  frequency  distribution. 

We  will  use  the  data  of  the  previous  section  as  an  illustration  (see  Table  2.1). 

We  divide  the  variable  (here  percentage  of  the  component)  into  about  10  to 
20  equal  sized  groups  (column  headed  X).  We  count  up  the  number  of  observa- 
tions occurring  in  each  of  these  groups  (column  headed  f).  This  column  f  is  the 
vertical  axis  in  figure  2.  We  select  a  new  origin  somewhere  near  the  middle  of 
the  range,  and  lay  out  a  new  scale  as  shown  in  the  column  headed  x.  The 
remainder  of  the  arithmetic  should  be  clear.  We  obtain  ax  in  our  x  units.  Since 
1  x-unit  equal  0.1  X-unit,  ax  =  0.182.  Since  x  =  0.59  this  means  that  the 
mean  is  0.59  units  up  from  the  x-origin,  or  0.59  =  0.06  units  up  from  the  X-origin. 
The  X-origin  was  at  2.20,  hence  the  mean  is  at  2.20  -f-  0.06  =  2.26  X-units. 

a*a=  [502-g]/(138-l) 

=  (502  — 48.72)/137  =  453.28/137 
=  3.31 
ax    =  1.82 
aX-  0.182 
x     =  82/138  =  0.59 
X    =  2.20  +  0.59/10  =  2.26 
In  our  discussion  of  the  properties  of  the  Gaussian  distribution  it  was  remarked 
that  95%  of  the  total  number  of  observations  would  lie  within  ±1.96  a  of  the 
mean.     These   limits   here    are    at   2.26—1.96x0.18  =  1.91    and    at   2.26  + 
1.96  X  0.18  —  2.61.     Inspecting  our  table  we  find  that  those  in  the  two  end 
intervals  are  1  and  3,  making  a  total  of  4  out  of  138,  or  2.9%.     However,  the 
1.85  —  1.94  and  2.55  —  2.64  intervals  probably  contain  some  outside  the  limits 
less  than  1.91  and  greater  than  2.61,  so  the  actual  agreement  is  reasonable. 
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Similarly,  we  would  expect  very  few  (of  the  order  of  2  out  of  1000)  observations 
outside  the  limits  2.26  —  3.09  X  0.18  =  1.70  and  2.26  +  3.09  X  0.18  =  2.82. 
Inspecting  our  table  we  find  none  of  our  138  outside  these  limits,  in  good  accord 
with  this  prediction  based  on  the  Gaussian  distribution. 

Table  2.1 


X 

f 

X 

fx 

fx» 

1.75—1.84 

1 

—4 

—4 

16 

1.85—1.94 

6 

—3 

—18 

54 

1.95—2.04 

8 

—2 

—16 

32 

2.05—2.14 
2.15—2.24 
2.25—2.34 

24 
26 
32 

—1 
0 
1 

—24 

24 

0 

32 

—62 

32 

2.35—2.44 

23 

2 

46 

92 

2.45—2.54 

9 

3 

27 

81 

2.55—2.64 

6 

4 

24 

96 

2.65—2.74 

3 

5 

15 

75 

138 

+144 
—  62 

502 

+  82 

It  is  often  worth  while  making  a  rough  comparison  of  this  type  in  order  to 
check  that  one's  calculations  of  the  variance  and  standard  deviation  are  not  grossly 
in  error.  The  check  is  sufficiently  sensitive  to  detect  the  two  commonest  errors, 
a  misplaced  decimal  point  and  the  reading  of  the  wrong  square  root  from  the  tables. 


(k)  Lognormal  Distributions 

For  certain  types  of  data  there  are  sometimes  good  reasons  for  considering 
the  use  of  the  logarithm  of  the  variable  rather  than  the  variable  itself.  Thus 
in  some  cases  the  larger  the  value  of  the  variable  the  more  erratic  will  it  become, 
e.g.  in  corrosion  experiments  the  larger  the  corrosion  the  less  the  agreement  there 
is  between  duplicates.  Under  these  circumstances  we  would  expect  a  skewed 
distribution  with  a  long  tail  to  the  upper  end.  However,  any  deviation  which 
is  proportional  to  the  mean  of  the  unlogged  data  when  expressed  logarithmically 
becomes  constant.  Figure  4  gives  the  distribution  of  throughputs  before  failure 
in  thousands  of  tons  of  a  piece  of  acid  plant.  Figure  5  is  the  same  data  grouped 
according  to  the  logarithm  of  the  throughput.  One  individual  tends  to  give  the 
impression  of  a  tail  at  the  lower  end,  but  taking  an  overall  view  it  is  doubtful  if 
the  logarithmic  distribution  departs  significantly  from  normality. 

When  we  have  a  variable  which  clearly  has  a  lognormal  distribution  and  we 
wish  to  make  a  test  of  significance,  e.g.  compare  two  means,  then  it  is  probably 
advisable  to  carry  out  the  test  on  the  logarithms  of  the  variable.  An  example  of 
this  is  given  in  Chapter  XII  (k). 
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CHAPTER  III 
SIGNIFICANCE  OF  MEANS 

(a)  Significance  of  a  Single  Mean 

Suppose  we  have  a  population  distributed  with  standard  deviation  a  about 
its  mean  X.  Suppose  we  take  samples  of  n  individuals  at  a  time.  The  first 
sample  will  have  a  mean  x1}  the  second  sample  will  have  a  mean  x2  not  necessarily 
the  same  as  x,,  and  so  on.  It  is  a  valuable  property  of  these  sample  means  that 
they  are  distributed  about  the  population  mean  X  with  standard  deviation  (a/Vn) 

Now  if  we  take  an  individual  at  random  and  find  that  it  lies  further  from 
the  population  mean  X  than  1.96  a  then  we  conclude  that  this  individual  is  not 
very  likely  to  belong  to  the  population  we  are  considering.  In  exactly  the  same 
way  if  we  take  a  sample  of  n  individuals  and  find  that  its  mean  is  more  than 
1.96  a/Vri  from  the  population  mean  X  then  we  conclude  that  it  is  not  very 
likely  that  this  sample  of  n  individuals  was  drawn  from  the  population  we  are 
considering. 

The  ratio  of  the  deviation  of  the  mean  x  of  a  particular  sample  of  n  individuals 
from  an  expected  value  E  to  its  standard  deviation  a/y/n  is  known  as  Student's  t, 
i.e. 

x  — E 

~a/Vn  - 
We  will  deal  with  a  specific  example.  Two  analysts  A  and  B  analysed  a 
series  of  chemical  mixtures,  and  obtained  the  results  below  for  the  percentage  of 
one  ingredient.  We  would  expect  the  differences  between  the  operators  for  each 
mixture  to  average  zero  if  there  was  no  systematic  difference  between  the  operators. 
The  essential  point  about  this  example  is  that  we  have  ten  numbers  (the  last 
row  in  Table  3.1)  and  we  wish  to  test  whether  their  mean  differs  significantly 
from  zero. 

Table  3.1 


Mixture 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

Analyst  A    . . 

Analyst  B 

7 
11 

9 

7 

8 
10 

10 
10 

8 
9 

11 
10 

9 

10 

8 
9 

9 

11 

8 
11 

A  — B  =  x 

—4 

2 

—2 

0 

— 1 

1 

— 1 

— 1 

—2 

—3 

The  average  value  x  for  A  —  B  is  — 1.10,  and  the  expected  value  E  is  0. 

We  calculate  the  standard  deviation  of  these  individuals  as 
V[[(-4)8  +  22  +  ...  +  (-3)2  — (— 4  +  2  +  ...—  3)V10]/(10— 1)]  =  1.79. 
The  standard  deviation  of  the  mean  x  is  given  by  a/Vn  =  1-79/V10  =  0.565. 
We  then  calculate  t  as  1.10/0.565  =  1.95. 

Now  the  value  we  used  for  o  is  of  course  only  an  estimate  of  the  population 
value.  Being  based  on  ten  individuals  it  is  not  as  accurate  as  if  it  were  based  on 
one  hundred  or  one  thousand  individuals.  The  table  of  the  significant  values  of 
t  takes  this  into  account  (Table  I  in  the  Appendix).  Examining  this  Table,  we 
see  that  for  infinite  degrees  of  freedom  (the  degrees  of  freedom  here  arc  numeric- 
ally one  less  than  the  numbers  of  observations)  a  value  of  t  as  large  as  1.96  should 
only  occur  1  in  20  times  (P  =  0.05)  if  the  hypothesis  being  tested  is  true.  Here 
the  number  of  observations  is  10,  so  the  degrees  of  freedom  are  9.  For  the 
5%  level  of  significance  we  would  need  a  value  of  t  of  2.26.  Our  present  value- 
is  1.95  and  this  is  less  than  the  value  for  the  5%  level  but  is  greater  than  the  value 
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1.83  for  the  10%  level.  The  significance  of  the  present  result  is  thus  between 
10%  and  5%,  i.e.  our  hypothesis  that  there  is  no  difference  between  the  two 
analysts  is  in  reasonable  accord  with  the  data.  If  our  values  of  t  had  been  a 
little  larger,  however,  we  would  have  had  to  regard  the  hypothesis  as  incorrect 
and  had  to  accept  the  conclusion  that  there  almost  certainly  was  a  genuine  difference 
between  the  analysts. 

(b)  Confidence  Limits  for  a  Single  Mean 

We  have  shown  that  the  difference  of  the  mean  of  this  quantity  (A — B)  is  not 
significantly  different  from  zero.  This  attainment  of  non-significance  might  be 
due  to  the  quantity  not  differing  from  zero,  but  alternatively  it  might  be  due  to 
the  inadequacy  of  our  data,  i.e.  our  data  may  be  insufficient  to  give  us  a  reasonably 
accurate  estimate  of  the  quantity.  We  can  estimate  which  of  these  two  alternatives 
is  operating  as  follows. 

We  are  going  to  determine  the  limits  between  which  we  can  be  reasonably 
confident  that  the  true  value  of  the  mean,  as  determined  from  our  sample,  lies. 

The  first  requirement  is  to  decide  what  we  mean  by  "reasonably  confident." 
We  might  consider  that  if  we  are  right  19  out  of  20  times  (equivalent  to  95%) 
we  would  be  satisfied. 

We  would  then  look  up  in  the  table  of  t  for  9  degrees  of  freedom  its  value 
for  the  5%  level  (95%  chance  of  being  right  is  equivalent  to  a  5%  chance  of  being 
wrong).  Here  t  is  2.26.  Then  the  limits  ±L  on  either  side  of  the  sample  mean, 
where  o  =  the  standard  deviation  of  the  population  and  n  is  the  number  of 
individuals  in  the  sample,  are 

±  L  =  ±  t  (a/Vn) 

=  ±  2.26  x  0.565  =  ±  1.28. 

Thus  it  is  reasonably  certain  (within  that  chosen  probability  level)  that  the  true 
value  of  the  mean  lies  between  —1.10  +  1.28  =  0.18  and  —1.10  —  1.28  =  —2.38. 
These  limits  are  known  as  95%  confidence  limits.  (x) 

If  19  out  of  20  was  not  sufficiently  certain  for  us,  we  might  choose  999  out 
of  1000  (99.9%)  and  find  from  the  table  for  t  the  value  for  9  degrees  of  freedom 
and  the  0.001  level  as  4.78.  The  limits  would  then  be  —1.10  ±  4.78  X  0.565 
or  from  —3.80  to  1.60. 

We  now  consider  the  actual  width  of  these  limits  (the  95%  or  99.9%,  or 
whatever  level  we  have  chosen).  They  fix  the  accuracy  with  which  we  have 
determined  the  mean.  If  the  limits  are  narrow  our  determination  is  accurate, 
if  they  are  wide  the  determination  is  not  very  accurate.  These  considerations 
should  indicate  whether  the  non -significant  result  we  obtained  was  due  to  the 
real  equality  of  the  mean  with  the  expected  value  or  due  to  the  inadequacy  of  the 
data.  Here,  where  the  mean  has  a  95%  chance  of  lying  anywhere  between  0.18 
and  — 2.38,  it  would  seem  that  our  determination  is  rather  inaccurate. (2) 

Greater  accuracy  in  a  mean  can  be  obtained  by  taking  more  observations, 
i.e.  by  increasing  n,  since  the  standard  deviation  of  the  mean  (or  standard  error) 
is  o/Vn.     Doubling  the  number  of  observations  will  give  limits  approximately 

— -  =  0.707  of  the  present  ones.  Quadrupling  the  numbers  would  give  limits 
—  =  0.500  of  the  present  ones. 

(J)  Strictly  speaking,  these  are  "fiducial  limits"  and  not  confidence  limits.  For  practica 
purposes  the  distinction  is  so  slight  as  to  be  not  worth  making,  and  because  of  its  greatei 
intelligibility  the  term  confidence  limit  is  used  here. 

(2)  A  more  accurate  statement  would  be  "If  we  say  that  the  true  population  mean  lies 
between  0.18  and  — 2.38,  then  the  probability  of  our  being  correct  is  0.95." 
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(c)  Comparison  of  Two  Means 

We  obtain  two  sets  of  numbers,  and  wish  to  test  whether  the  mean  of  one 
set  is  significantly  different  from  the  mean  of  the  other  set,  or  whether  the  two 
sets  can  be  regarded  as  drawn  from  one  population. 

Let  xlf  x, xm,  and  x/,  x8', x'na 

be  the  two  samples. 

We  calculate  the  following  terms.     S  denotes  summation. 


x  = 


=[s(x,)_M!+  E(x>)_(W]/(ni  +  n,_2) 

i_  na  nj     -i 


t     ==- 


fax  X  n, 


Having  found  the  value  of  t,  we  enter  the  table  of  t  with  degrees  of  freedom 
(nj  +  n, — 2).  If  our  value  of  t  exceeds  that  for  the  5%  level  we  may  take  it 
with  that  degree  of  assurance  that  the  populations  are  different. 

Let  us  consider  a  specific  example.  The  numbers  below  represent  the 
throughputs  in  hundreds  of  tons  of  acid  before  failure  for  two  sets  of  pots,  one 
from  Foundry  A,  the  other  from  Foundry  B,  used  in  sulphuric  acid  pot  con- 
centration. Is  there  any  reason  to  suppose  that  one  foundry  is  better  than  the 
other  ? 


Foundry  A. 
Foundry  B. 


71,  67,  33,  79,  42 
73,  80 


2Q2 
xA  =  (71  +  67  +  33  +  79  +  42)  /  5  =^~  =  58.4 

153 
2 

292*  . .   _     153*)- 


xB  =  (73  +  80)/2 


=  76.5 


=  [(71 

=  1596/5  = 
=  17.9 
76.5  —  58 


2  +  67*  +  . . 
319.2 


+  42' 


73.  +  80*-^]/(5  +  2-2) 


-    \/— 

V  5  + 


2 
17.9  V5  +  2 

=  1.21  for  (5  +  2  —2)  =  5  degrees  of  freedom. 

Consulting  the  table  of  t,  we  find  that  for  5  degrees  of  freedom  t  must  be 
not  less  than  2.57  to  be  significant  at  the  5%  level.  Accordingly  there  is  no 
evidence  that  here  the  populations  are  different. 

Looking  up  this  value  of  t  in  a  more  complete  table  of  t  than  the  one  in  the 
Appendix,  we  find  it  corresponds  to  a  level  of  significance  of  approximately  25%. 
That  is  'to  say,  in  the  absence  of  any  difference  between  the  two  foundries,  if  we 
repeatedly  sampled  from  them  we  would  get  as  large  an  apparent  difference 
between  them  as  we  did  here  1  out  of  every  4  times.  The  present  result,  therefore, 
is  only  a  very  slight  indication  of  the  possible  superiority  of  Foundry  B. 

We  have  shown  that  the  difference  in  these  means  is  not  significantly  greater 
than  zero.    We  can  further  lay  down  the  confidence  limits  within  which  the 
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difference  will  have  a  chosen  probability  of  lying.    We  proceed  on  the  same  lines 
as  before. 

We  have  that  the  standard  error  of  the  difference  in  the  means  of  the  two 
samples,  denoted  by  am,  is 


Om 


=  0  vAL+£l=  17.9  X    lA±2=i4.9 

V  nx  X  n8  V  5  X  2 


We  also  have  t  =  its  value  in  the  table  of  t  corresponding  to  (nt  +  n8  — 2)  = 
(5  +  2  —  2)  =  5  degrees  of  freedom  and  to  the  significance  level  chosen.  If  we 
select  the  5%  level  of  significance  (95%  chance  of  being  correct  in  the  prediction 
that  the  true  value  of  the  difference  in  the  means  of  the  two  samples  lies  between 
the  limits  stated)  then  for  5  degrees  of  freedom,  t  =  2.57. 

Then  if  ±  L  is  the  maximum  deviation  likely  (at  this  significance  level)  we 
have 

L  =  tom  =  2.57  x  14.9  =  38.3 

The  actual  mean  value  obtained  for  the  difference  between  the  two  types  of 
pot  was  18.1  The  true  value  will  lie  95%  of  cases  ±.  38.3  on  either  side  of  this, 
i.e.  from  — 20.2  to  +  56.4  We  thus  see  what  a  poor  comparison  is  afforded  by 
the  data  cited.  There  may  well  be  marked  differences  between  the  foundries, 
but  the  present  data  is  quite  inadequate  to  answer  these  queries. 

(d)  Conclusions 

In  general,  therefore,  in  any  account  of  experimental  work  involving  the 
determination  of  a  mean  in  addition  to  the  value  obtained,  its  standard  deviation 
and  the  number  of  observations  upon  which  it  is  based  should  also  be  given. 
This  will  enable  readers  to  assess  the  level  of  significance  of  the  result. 

An  important  proviso  in  the  calculation  of  a  standard  deviation  is  that  the 
individuals  should  be  independent.  Thus  if  we  made  10  batches  of  some  material, 
and  took  duplicate  samples  from  each  batch,  we  could  not  calculate  the  standard 
deviation  on  the  assumption  that  we  made  20  observations.  Clearly  the  duplicate 
observations  on  each  batch  are  not  independent  of  each  other.  The  methods  of 
dealing  with  this  type  of  situation  are  discussed  later  (Chapter  XII  (j)). 


CHAPTER  IV 
COMPARISON  OF  VARIANCES 

(a)  Comparison  of  Two  Variances 

When  we  wish  to  compare  the  means  of  two  groups  of  numbers  we  use  the 
Student  t  test.  It  is  sometimes  the  case,  however,  that  what  we  wish  to  compare 
is  the  spreads  or  variabilities  of  the  two  sets  of  figures,  and  this  is  a  different 
problem. 

The  test  of  significance  for  this  type  of  problem  is  due  to  Fisher  (Fisher 
actually  dealt  with  the  natural  logarithm  of  the  ratio  of  the  square  roots  of  the 
variances,  which  he  called  z,  but  here  we  will  use  the  simple  variance  ratio  which 
is  denoted  by  F). 

To  deal  with  an  example,  suppose  we  have  two  alternative  methods  of  tech- 
nical chemical  analysis.  One  or  both  of  the  methods  may  have  a  systematic  bias, 
but  that  does  not  matter,  for  all  we  require  is  a  method  that  will  give  a  reproducible 
result. 

Table  4.1  gives  the  results  of  6  analyses  by  Method  A  on  a  sample  and  7 
analyses  by  Method  B  on  the  same  sample. 
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Table  4.1 

Method  A. 

95.6         94.9         96.2         95.1         95.8         96.3 

Method  B. 

93.3         92.1         94.7         90.1         95.6         90.0         94.7 

The  averages  for  Methods  A  and  B  are  95.65  and  92.93  respectively.  If  we 
wished  to  test  the  significance  of  this  apparent  difference  we  would  use  the 
Student  t  test.     Here  we  are  comparing  the  variances,  which  we  proceed  to 

C*\\  CI  1  ]  fitc 

o..^[(5.6«  +  49.  +  ...  +  6.3.)-(5-6  +  4-9+-  +  63>']/(6-1) 

=  (193.15  —  191.53)/5  =  0.324 
at8  =   [(3.32  +  2.1*  +  . . .  +  4.7*)  - (13  +  2A  ±  ' ' '  +  47)2]  /(7  -  1). 

=  (90.85  —  60.04)/6  =  5.14. 

We  now  calculate  the  ratio  of  the  larger  variance  to  the  smaller,  termed  the 
variance  ratio  or  F.  Attached  to  the  variance  ratio  are  two  sets  of  degrees  of 
freedom,  nx  for  the  larger  variance  and  n2  for  the  smaller. 

Here  the  variance  ratio  is  5.14/0.324  =  15.9  for  degrees  of  freedom  nx  =  6, 
n2  =  5.  In  the  Appendix  are  tables  for  the  variance  ratio  for  a  comprehensive 
series  of  values  of  nx  and  n2  for  the  20%,  5%,  1%  and  0.1%  levels  of  significance. 
If  our  value  of  the  variance  ratio  is  greater  than  that  given  in  a  particular  table 
for  the  corresponding  degrees  of  freedom,  then  our  result  is  more  significant  than 
the  significance  level  for  the  table. 

Thus  for  degrees  of  freedom  nx  =  6,  n2  =  5,  for  the  20%,  5%,  1%  and 
0.1%  levels  of  significance  the  values  of  the  variance  ratio  are  2.2,  5.0,  10.7  and 
28.8  respectively.  Our  value  of  the  variance  ratio  was  15.9  :  its  significance  thus 
lies  between  the  1%  and  the  0.1%  levels  (but  see  below). 

That  is  to  say,  assuming  our  hypothesis  to  be  correct  we  would  be  very 
unlikely  to  get  the  result  we  did.  Accordingly  it  is  very  probable  that  our  hypo- 
thesis is  incorrect.  Our  hypothesis  was  that  the  two  sets  of  figures  were  drawn 
from  the  same  population,  i.e.  the  variabilities  of  the  two  methods  were  the  same. 
Our  hypothesis  having  been  proved  incorrect,  we  can  assert  that  the  variabilities 
of  the  two  methods  do  differ  significantly. 

A  point  of  some  subtlety  arises  here.  The  variance  tables  were  constructed, 
for  the  purpose  of  testing  whether  a  variance  A  was  greater  than  a  variance  B, 
A  being  known  beforehand  as  derived  in  a  particular  way  which  distinguished  it 
from  B.  Here  we  are  testing  whether  a  variance  A  is  greater  than  a  variance  A1, 
and  until  we  have  calculated  them  we  do  not  know  which  is  A  and  which  is  A1 : 
A  is  that  one  which  turns  out  to  be  the  larger.  The  effect  of  this  distinction  is 
to  alter  the  level  of  significance  of  the  20%,  5%,  1%,  and  0.1%  tables  to  40%, 
10%,  2%,  and  0.2%  respectively. 

(b)  Averaging  of  Several  Variances 

Suppose,  extending  our  previous  example,  we  had  a  number  of  determinations 
of  variability  for  each  method  on  a  number  of  different  samples.  Suppose  a 
second  sample  analysed  by  Method  B  gave  results  93.1,  91.2  and  92.6.  Its  variance 
would  then  (using  the  revised  zero  by  subtracting  90  from  every  observation)  be 
[3.12  +  1.2"  +  2.62  —  (3.1  +  1.2  +  2.6)2/3]/(3  —  1)  =  (17.81  —  6.92/3)/2 
=  1.94/2  =  0.97. 

To  form  the  average  of  the  two  variances,  we  weight  them  in  accordance 
with  their  degrees  of  freedom.     Thus  the  average  would  be 

2  X  0.97  +  6  X  5.14 
2  +  6 
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This  average  is  the  same  as  obtained  by  pooling  the  sums  of  squares  and 
degrees  of  freedom  for  the  two  samples,  i.e.  (30.82  -f-  1.94)/(6  +  2). 

This  is  the  correct  method  for  obtaining  an  average  variance.     It  is  incorrect 

(a)  to  take  the  average  of  the  variances  without  regard  for  the  degrees  of 
freedom  ; 

(b)  to  take  the  average  of  the  standard  deviations. 

If  we  have  the  variances  expressed  as  standard  deviations,  we  should  square 
them  to  convert  them  to  variances,  and  take  the  average  as  indicated. 

The  average  variance,  correctly  obtained,  can  be  regarded  as  having  as  its 
degrees  of  freedom  the  sum  of  the  degrees  of  freedom  of  the  separate  variances 
going  to  make  up  that  average.  Thus  in  the  present  example  the  degrees  of 
freedom  of  the  average  variance  are  (6  +  2)  =  8. 

In  parenthesis,  it  might  be  pointed  out  that  in  the  calculation  of  the  average 
variance  of  the  two  samples,  the  two  variances  must  be  calculated  separately  and 
then  pooled.  It  would  be  incorrect  to  pool  all  the  7  -f-  3  =  10  observations 
together  and  calculate  their  variance,  because  the  variance  so  obtained  includes 
not  only  the  variance  of  repeat  measurements  on  the  same  sample  (which  is  what 
we  require)  but  also  the  variance  between  samples,  which  is  quite  irrelevant. 

(c)  Comparison  of  Several  Variances 

The  Fisher  variance  ratio  test  will  compare  .two  variances,  but  not  more 
than  two.  Suppose  we  have  a  group  of  ten  machines  turning  out  batches  of 
some  product,  and  we  measure  some  quality  x  on  each  batch.  Suppose  we  suspect 
that  some  machines  manufacture  the  product  more  regularly  than  others,  i.e.  that 
there  is  less  variability  in  their  product.  The  test  of  significance  for  this  type  of 
problem  is  generally  known  as  Bartlett's  test. 

Column  2  in  the  table  below  represents  the  value  of  the  property  x  of  six 
consecutive  batches  off  machines  1  to  10.  The  figures  in  column  2  actually  are 
transformed  by  shifting  the  zero  to  simplify  the  arithmetic  :  originally  each  had 
2200  added  on  to  them. 


Machine 

Data 

S(x) 

2(x2) 

(Z(x))'/6 

Z(x— x)2 

a2 

logeH) 

1 

28,  14,  15,  2,  9,  2, 

70 

1294 

816.7 

477.3 

95.46 

4.5587 

2 

11,  3,  0,  9,  2,  12 

37 

359 

228.2 

130.8 

26.16 

3.2642 

3 

7,0,-1,-1,0,1 

6 

52 

6.0 

46.0 

9.20 

2.2192 

4 

8,  0,  3,  —1,  6,  —1 

15 

111 

37.5 

73.5 

14.70 

2.6878 

5 

10,  5,  3,  0,  0,  1 

19 

135 

60.2 

74.8 

14.96 

2.7054 

6 

4,  8,  7,  0,  -1,  1 

19 

131 

60.2 

70.8 

14.16 

2.6502 

i 

11,2,3,  11,  10,0 

37 

355 

228.2 

126.8 

25.36 

3.2331 

8 

15,  10,  0,  8,  10,  10 

53 

589 

468.2 

120.8 

24.16 

3.1847 

9 

7,  7,  4,  9,  5,  6 

38 

256 

240.7 

15.3 

3.06 

1.1184 

10 

10,2,-1,10,-1,7 

27 

255 

121.5 

133.5 

26.70 

3.2847 

253.92 

28.8906 

The  columns  headed  £(x),  £(x2),  (E(x))2/6  and  £(x  — x)2  where  Z  denotes 
summation,  are  merely  the  steps  in  calculating  the  variances  for  the  ten  machines. 

The  latter  are  given  in  the  column  headed  a2. 

We  now  wish  to  test  the  hypothesis  that  these  ten  variances  could  reasonably 
have  been  obtained  from  the  same  population,  i.e.  that  the  batches  from  the  ten 
machines  are  of  the  same  variability. 
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We  proceed  by  obtaining  (from  tables)  the  natural  logarithm  of  each  variance 
(last  column),  and  summing  these.  Let  us  denote  this  sum  by  £(logeai).  We 
take  the  mean  of  the  variances  as  253.92/10  =  25.392  :  we  can  denote  this  mean 
variance  as  S2.  We  take  its  natural  logarithm  logeS2,  here  3.2343.  Let  k  equal 
the  number  of  variances  being  compared  (here  10)  and  n  equal  the  degrees  of 
freedom  of  these  individual  variances  (here  5).  Then  we  calculate  two  terms 
B  and  C  as 


B  =  kn  logeS2  —  n    S(logeaj) 

=  10  x  5  x  3.2343  —  5  x  28.8906  =  17.18 


and  C  =  1  + 


k+1 
3  n  k 


=  1  +  io  +  i  73 

3  x  10  x  5 

If  B/C  is  greater  than  x2  at  the  5%  level  of  significance  for  (k  —  1)  degrees 
of  freedom,  then  there  is  an  indication  of  that  level  of  significance  that  the  variances 
are  not  drawn  from  a  homogeneous  population. 

Here  B/C  =  17.18/1.073  =  16.0  with  (10  —  1)  =  9  degrees  or  freedom. 
For  9  degrees  of  freedom  for  the  10%,  5%  and  2%  levels  of  significance  x2>  or 
here  B/C,  needs  to  reach  values  of  14.68,  16.92  and  19.68  respectively.  Here 
our  present  value  does  not  reach  the  5%  level,  but  is  near  it,  and  there  is  some 
indication  that  the  machines  do  differ  significantly  in  the  variability  of  their 
product. 

It  should  be  clear  that  if  the  largest  and  smallest  variances  of  a  set  do  not 
differ  significantly  as  tested  by  the  variance  ratio  test,  then  clearly  those  lying 
between  cannot  differ  significantly  either,  and  the  whole  set  can  be  reasonably 
regarded  as  coming  from  a  single  population.  In  these  circumstances  there  would 
be  no  need  to  employ  the  Bartlett  Test. 

A  more  general  form  of  Bartlett 's  test,  for  the  case  where  the  several  variances 
being  compared  have  different  degrees  of  freedom,  is  available,  but  will  not  be 
discussed  here. 

(d)  Confidence  Limits  for  Variances 

Iii  the  same  way  as  we  laid  down  confidence  limits  for  means,  we  can  lay 
down  confidence  limits  for  variances. 

(i)  Small  Samples 

If  we  wish  to  calculate  the  probable  upper  limit  for  the  true  variance,  (say  9), 
at  say  the  5%  level  of  significance,  we  calculate 

n       2 
X2 
where  a2  is  the  variance  as  calculated  from  our  sample  with  n  degrees  of  freedom, 
and  x2  has  its  value  corresponding  to  n  degrees  of  freedom  and  the  5%  level  of 
significance. 

Thus  suppose  we  have  a  variance  a2  of  10.0  units,  based  on  a  sample  of  ten 
individuals  (nine  degrees  of  freedom),  then  for  the  5%  level  of  significance 
X2  =  16.92  and 

,  =  _£_  x  10.O  =  5.31 

This  is  the  lower  limit.     The  corresponding  upper  limit  is  given  by  the  95%  level 
of  x2,  for  which  x2  for  9  degrees  of  freedom  is  3.32,  whence 
9 


•  -  332  X  10-°  -  27A 
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Regarding  these  limits  together,  we  have  a  5%  probability  of  the  true  population 
variance  being  less  than  5.31  and  a  5%  probability  of  its  exceeding  27.1  The 
total  probability  of  the  true  value  being  outside  these  limits  is  thus  (5%  -f  5%) 
=  10%. 

Speaking  generally,  therefore,  our  confidence  limits  should  be  regarded  as 
10%  and  not  5%  ;  if  we  wish  to  have  limits  nearer  5%  we  can  use  the  98%  and 
2%  levels  of  x2  giving  us  confidence  limits  of  4%. 

(ii)  Large  Samples 

For  larger  sample  (greater  than  30  individuals)  we  can  use  the  fact  that  the 

standard  deviation  a    of  a  standard  deviation  o  is     — —  where  n  is  the   number 

a  V2n 

of  observations. 

Thus  if  we  have  a  sample  of  50  individuals  whose  standard  deviation  is  10.0, 
the  standard  deviation  of  the  sample  standard  deviation  is  10.0/V2  X  50  =1.0, 
The  95%  confidence  limits  are  thus  10.0  db  2.01  X  1.0  (2.01  is  the  value  of  t  for 
the  5%  level  of  significance  and  50  degrees  of  freedom). 


CHAPTER  V 
THE  xa  TEST 

(a)  Introduction 

We  now  turn  to  a  different  type  of  problem  which  requires  a  technique 
different  from  those  previously  discussed.  The  x2  test  is  applied  in  general  to 
those  problems  in  which  we  wish  to  determine  whether  the  frequency  with  which 
a  given  event  has  occurred  is  significantly  different  from  that  which  we  expected. 

The  technique  is  applied,  therefore,  to  such  phenomena  as  the  occurrence 
of  heads  or  tails  in  the  spinning  of  a  coin  :  the  frequency  of  bursting  of  drums 
in  transit :  the  numbers  of  accidents  on  different  shifts  :  the  number  of  defective 
castings  made  by  different  methods. 

To  apply  the  method  we  calculate  the  expected  frequencies  of  the  events  on 
the  basis  of  the  theory  we  are  testing,  and  we  then  test  whether  the  observed 
frequency  differs  significantly  from  the  expected.  If  it  does,  of  course,  then  our 
theory  is  unlikely  to  be  correct :  if  it  does  not,  then  our  theory  is  in  reasonable 
accordance  with  the  data,  and  the  differences  can  be  due  to  random  error. 

The  statistic  x2  is  defined  as 

where  O  is  the  observed  class  frequency,  E  is  the  expected  class  frequency,  and 
2  denotes  summation  over  all  the  classes. 

In  general,  it  is  inadvisable  to  use  the  xa  test  where  any  expected  class 
frequency  is  less  than  5.     This  point  is  discussed  later  in  Chapter  V  (h). 

We  will  now  discuss  a  series  of  examples. 

(b)  The  1    x    2  Table 

Suppose  we  tossed  a  coin  100  times,  and  we  obtained  60  heads  and  40  tails. 
Could  we  say  that  the  coin  was  definitely  biassed  ? 
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Here  we  have  two  classes,  heads  and  tails,  and  on  the  hypothesis  which  we 
are  testing  the  expectation  E  in  each  class  is  50. 

We  have  to  make  a  correction  for  the  fact  that  our  actual  distribution  is 
discontinuous  whereas  x2  *s  a  continuous  variable.  This  correction  for  con- 
tinuity, as  it  is  called,  consists  in  reducing  by  0.5  the  values  which  are  greater 
than  expectation  and  increasing  by  0.5  those  which  are  less.     We  thus  have 

(59.5  -50)'  (40.5  -50)' 

x  50  "*"  50 

=  3.61 

The  appropriate  number  of  degrees  of  freedom  is  found  as  the  number  of 
classes  to  which  values  can  be  assigned  arbitrarily.  Here,  given  the  total,  we 
can  only  fill  one  class  arbitrarily,  because  when  that  is  done  the  second  class  is 
automatically  determined,  i.e.  if  we  make  100  tosses  and  obtain  60  heads,  then 
the  number  of  tails  must  be  40.  Accordingly,  entering  the  table  of  x2  with  one 
degree  of  freedom,  we  see  that  to  reach  the  5%  level  of  signifiance  x2  would  have 
to  be  equal  -to  or  greater  than  3.84.  Hence  our  result  does  not  reach  significance 
at  the  5%  level,  and  so  we  cannot  assert  that  the  coin  is  definitely  biassed. 

It  might  perhaps  be  stressed  that  the  probability  we  have  derived  is  not  that  of 
getting  60  heads  exactly,  but  that  of  getting  either  60  or  more  heads  or  60  or  more 
tails.     The  probability  of  getting  exactly  60  heads  with  an  unbiassed  coin  is 

Wrm  ■       l£r  =  0-010843  867 

(this  at  first  sight  rather  formidable  calculation  is  made  easy  by  the  tables  of 
powers  and  of  factorials  contained  in  Barlow's  Tables  of  Squares,  etc.  (E.  &  F.  N. 
Spon)).     The  probability  of  getting  exactly  61  heads  is 

100!  1 


61  !    39  ! 


0.007110  733. 


Proceeding  in  this  manner,  we  can  get  the  series  for  62,  63,  etc.,  heads,  and  sum- 
ming this  series  we  get  for  the  sum  of  probabilities  0.028  443  968.  This  is  the 
probability  of  getting  60  or  more  heads.  Adding  the  similar  probability  of  getting 
60  or  more  tails,  the  total  probabiilty  is  0.056  887  436,  or  5.69%. 

The  x2  test  gives  as  the  probability,  using  a  detailed  table  of  x2  for  1  degree 
of  freedom*1),  as  5.8%.     The  two  methods  are  thus  in  close  agreement. 

(c)  The  x2  Table 

It  will  be  noted  that  the  x2  Table  (Table  II  in  the  Appendix)  is  given  for 
values  of  P  greater  than  0.10,  namely  0.50,  and  0.90,  0.95,  0.98,  and  0.99. 

The  latter  values  are  those  which  occur  when  a  fit  is  better  than  it  should 
be.  It  may  seem  surprising  that  this  could  be  so.  However,  consider  the  case 
of  a  trial  of  1,000  tosses  to  decide  whether  a  coin  is  biassed.  Our  expectation 
for  each  of  the  two  classes  are  500.  If  we  got  exactly  500  of  each  in  our  trial, 
this  agreement  would  be  remarkable.  Thus  suppose  we  repeated  the  experiment 
a  large  number  of  times,  then  in  general  wc  should  expect  to  get  observed  values 
of  the  frequencies  corresponding  to  values  of  x2  corresponding  to  values  of  P 
scattered  over  the  range  0.95  to  0.05,  with  1  out  of  40  outside  the  upper  limit 
and  1  out  of  40  outside  the  lower  limit.  The  value  of  x2  which  turns  up  most 
frequently  is  that  corresponding  to  P  =  0.50.  Close  fits  should  only  occur  with 
that  frequency  given  the  x2  test,  and  if  they  occurred  more  frequently  we  might 
reasonably  suspect  that  the  data  was  being  "cooked." 

(^Such  as  that  in  "An  Introduction  to  the  Theory  of  Statistics"  :    G.  U.  Yule  and 
M.  G.  Kendall  (Charles  Griffin),  Appendix  Table  4a. 
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(d)  The  1  x  n  Table 

An  example  of  a  1  x  n  table,  in  which  n  =  3,  is  below. 


Shift 

Accidents  in  a  particular  period 

A 
B 
C 

1 
7 
7 

We  formulate  the  hypothesis  that  there  is  no  association  of  number  of  acci- 
dents with  shift,  and  proceed  to  test  it  with  the  x2  test.     On  our  hypothesis  the 
expectations  for  the  three  classes  are  5,  5,  and  5  respectively. 
Y,  =  (5  ~  l)a    ,    (7-5)'        (7-5)' 
x  5         ^        5         T        5 

=  4.80 

The  degrees  of  freedom  are  two  (given  the  total,  we  can  fill  in  two  classes 
arbitrarily  but  then  the  third  class  is  uniquely  determined).  Entering  the  x1 
table,  we  find  that  for  2  degrees  of  freedom,  to  reach  the  5%  level  of  significance 
X2  has  to  be  equal  to  or  greater  than  5.99.  Since  our  value  of  xa  is  less  than 
that,  our  hypothesis  is  reasonable,  i.e.  there  are  insufficient  grounds  for  asserting 
that  the  three  shifts  differ  in  their  liability  to  accidents. 

This  result  will  probably  be  surprising  to  the  non-statistician,  who  almost 
certainly  would  have  assumed  that  it  was  quite  certain  that  the  three  shifts  did 
differ.  It  is  important  to  note  that  the  statistician  does  not  assert  that  the  three 
shifts  do  not  differ  :  he  merely  states  that  the  evidence  that  they  differ  is  inadequate. 
Of  course,  we  might  formulate  the  hypothesis  that  the  accidents  are  in  the  ratio 
1:7:7;  x2  would  then  be  zero,  i.e.  there  is  no  evidence  that  the  accidents  depart 
from  the  ratio  1:7:7.  It  is  thus  apparent  that  the  data  is  simply  inadequate  to 
draw  any  conclusion. 

Suppose  we  waited  for  a  further  period,  and  the  accidents  in  the  total  period 
were  2,  14,  and  14  respectively.     The  table  would  then  become 


Shift 

Accidents  in  total  period 

A 
B 
C 

2 
14 
14 

Again  testing  our  hypothesis  that  there  is  no  association  of  accidents  with 
shift,  the  total  number  of  accidents  is  30,  and  on  this  hypothesis  the  expectation 
E  for  each  category  is  10. 

Accordingly 

10)a        (14— 10)» 


(2-10)2    ,    (14 

10         "*"         10 


10 


=  9.60 

Such  a  value  of  x2  with  2  degrees  of  freedom  is  more  significant  than  the 
1%  level,  i.e.  on  our  hypothesis  such  a  value  would  have  been  obtained  less  than 
1  in  100  times.  Thus  the  additional  evidence  has  now  made  it  very  probable 
that  the  shifts  do  differ  in  their  liability  to  accident. 

Table  5.1  gives  some  data  obtained  on  a  process  for  beating  paper  in  a  beater. 
Four  alternative  methods  of  loading  the  beater  were  tried,  the  numbers  of  cycles 
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carried  out  for  the  four  cycles  being  given  by  the  first  row  in  Table  5.1.  The 
second  row  gives  the  number  of  chokes  of  paper  between  the  bed-plate  and  the 
roll  that  occurred. 

Table  5.1 


Method  of  Loading 

A 

B 

C 

D 

Total 

Number  of  cycles 
Chokes  occurring 
Chokes  expected 

8 

5 

6.4 

10 

8 

8.0 

9 
9 

7.2 

13 

10 

10.4 

40 
32 

32 

We  wish  to  test  the  hypothesis  that  there  is  no  association  of  frequency  of 
choking  with  method  of  loading,  i.e  that  frequency  of  choking  is  independent  of 
method  of  loading.  On  this  hypothesis  the  expectation  of  a  choke,  the  same  for 
all  methods,  is  given  by  32/40  —  0.80  chokes  per  cycle.  The  third  row  in 
Table  5.1  gives  the  expectation  for  the  four  methods.    We  have  therefore 

.  _  1.4«       08       1.8'      0.4* 

x  ~~  6.4  +  8.0  +  7.2  +  10.4 
=  0.77 
We  have  used  one  degree  of  freedom  in  making  the  restriction  that  our  expected 
total  is  equal  to  the  actual  total.  (If  we  had  an  a  priori  expectation,  e.g.  if  we 
had  the  hypothesis  that  there  was  one  choke  per  cycle,  then  we  wouldn't  have 
lost  this  degree  of  freedom).  Entering  the  table  for  x2  with  3  degrees  of  freedom, 
we  see  that  to  reach  the  5%  level  of  significance  we  would  have  to  have  a  xa  of 
7.82.  It  is  clear  that  the  present  data  is  in  excellent  accord  with  the  hypothesis 
that  the  frequency  of  choking  is  independent  of  the  method  of  loading. 

It  will  be  noted  that  Table  5.1  at  first  sight  is  a  2  X  n  table  (see  Section  (f)). 
In  a  2  X  n  table  the  frequencies  being  compared  relate  to  the  same  general  class 
of  events.  Here,  however,  the  occurrence  of  a  cycle  is  not  of  the  same  class  as 
the  occurrence  of  a  choke.  Accordingly  we  are  concerned  with  the  1  X  n  table 
of  chokes  occurring  and  only  use  the  other  row  to  derive  our  expected  frequencies. 

(e)  The  2x2  Table 

A  problem  frequently  occurring  can  be  reduced  to  the  following  form  : — 

One  type  of  process  produces  (a)  defectives  in  a  sample  of  (a  +  b)  individuals  : 
another  type  of  process  produces  (c)  defectives  in  a  sample  of  (c  +  d)  individuals. 
Are  the  two  populations  the  same  or  different  ? 

This  type  of  problem  can  occur,  for  example,  in  cordite  manufacture,  where 
the  defectives  might  be  sticks  rejected  for  e.g.  surface  blemishes,  and  we  are 
interested  in  whether  e.g.,  one  method  of  packing  the  press  cylinder  gives  less 
rejects  than  another. 

The  problem  is  formally  exactly  the  same  when  articles  are  being  tested  with 
go-not  go  gauges  instead  of  continuously  measuring  gauges  ;  if  an  article  fails  to 
pass  on  one  or  the  other  of  the  gauges  it  is  a  defective,  if  it  passes  the  gauges  it 
is  a  non-defective. 

To  take  an  example,  suppose  that  in  our  first  sample  of  1000  there  were 
100  defectives  (fraction  defective  10%)  and  in  the  second  sample  of  500  there 
were  60  defectives  (fraction  defective  12%).  Are  we  justified  in  taking  it  as 
reasonably  certain  that  the  populations  represented  by  these  two  samples  are 
different  ? 

The  data  can  be  assembled  in  the  form  below  where  it  is  sometimes  referred 
to  as  a  "contingency  table." 
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Defective 

Non-Defective 

Totals 

Process  A 
Process  B 

a=  100 
c=    60 

b  =  900 
d  =  440 

a  +  b  =  1000 
c  +  d=    500 

Totals 

a  +  c  =  160 

b  +  d '  =  1340 

1500 

The  hypothesis  we  are  going  to  test  is  that  there  is  no  association  between 
the  occurrences  of  defective  articles  and  the  type  of  process.  On  this  hypothesis, 
we  shall  calculate  the  expectation  for  each  cell  of  the  table.  To  consider  the  first 
cell,  a,  this  refers  to  defectives,  and  the  probability  of  a  defective  is  given  by 
160/1500.  It  also  refers  to  process  A,  and  the  probability  of  any  individual 
belonging  to  process  A  is  1000/1500.  The  probability  of  any  individual  falling 
into  cell  a  is  therefore  (160/1500)  (1000/1500)  and  the  actual  expectation  is  this 
probability  times  the  total  number,  i.e. 


160 


1000 


1500   X    1500   X 


1500 


160  x    1000 
1500 


=  106.7 


Similarly,  the  expectations  for  b,  c,  and  d  are 
1340  X  1000       160  x  500 


1500 


Accordingly 


6.72 
106.7 
=  1.4 


+ 


893.3, 


6.72 
893.3 


+ 


1500 

672 
53.3 


„  ,        ,  1340  x  500 

=  53.3,  and r^^ —  =  446.7 


1500 


+ 


6.72 
446.7 


For  one  degree  of  freedom  (given  the  marginal  totals,  we  can  only  fill  in 
one  cell  arbitrarily,  for  then  the  others  are  uniquely  determined)  this  is  about 
the  20%  level  of  significance,  i.e.  the  apparent  superiority  of  process  A  is  not 
established  by  this  experiment. 

We  omitted  in  the  above  example  to  apply  the  correction  for  continuity : 
with  large  numbers  such  as  are  in  the  above  example  this  is  permissible,  for  the 
resulting  difference  is  only  slight. 

In  the  above  example  we  calculated  the  cell  expectations  separately,  and 
summed  the  series  of  terms  for  x2-  There  is,  however,  a  convenient  mechanical 
method,  which  merely  involves  calculating 

2 (ad  —  be)2  X  N 

X   ~  (a  +  b)  (b  +  d)  (c  +  d)  (a  +  c) 

where  a,  b,  c,  and  d  have  the  meanings  as  in  the  contingency  table  quoted  above 
and  N  =  a-|-b  +  c-f-d.     Thus  in  the  above  example 

,       (100  X  440  —  900  x  60)a  X  1500 
x   ""        160  x  1340  x  500  x  1000 

=  1.4 
as  before. 

With  small  sample  sizes  it  is  essential  to  apply  the  correction  for  continuity, 
as  in  the  following  example. 

A  pressing  process  using  one  batch  of  powder  gave  9  defective  pellets  out 
of  45  pressed  (20%  defective)  and  with  a  second  batch  of  powder  29  defectives 
out  of  87  pressed  (33%  defective). 
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Batch  A 

Batch  B 

Totals 

Non-Defective 
Defective 

58  =  a 
29  =  c 

36  =  b 
9  =  d 

94 

38 

Totals 

87 

45 

132 

As  before,  the  method  of  correcting  for  continuity  is  to  add  J  to  the  observa- 
tion in  those  cells  which  are  below  expectation  and  subtract  \  from  those  above 
expectation.     The  marginal  totals  are,  of  course,  unchanged  by  this  operation. 

2  _  (58.5  x  9.5  —  28.5  x  35.5)*  x  132 
x   ~  94  x  45  x  38  x  87  ~~ 

For  1  degree  of  freedom  this  is  quite  non-significant.  If  we  had  neglected 
to  apply  the  correction  for  continuity  we  would  have  obtained  a  value  for  x8  of 
2.57,  in  this  case  still  non-significant.  In  borderline  cases,  however,  the  difference 
may  be  important (2). 

♦ 

(f)  The  2  x  n  Table 

The  data  in  Table  5.2  below  refers  to  two  types  of  incident,  two  certain  types 
of  breakdown,  A  and  B,  which  can  occur  in  a  certain  process.  The  process  can 
be  run  on  three  grades  of  raw  material,  L,  M,  and  N. 

Table  5.2 


L 

M 

N 

Totals 

A 
B 

42 
20 

13 
8 

33 
25 

88 
53 

Totals 

62 

21 

58 

141 

We  form  the  hypothesis  that  there  is  no  association  of  type  of  incident  with 
raw  material.  In  other  words,  that  the  relative  likelihood  of  incidents  is  the  same 
for  the  three  types  of  raw  material.  We  proceed  to  test  this  hypothesis  with  the 
Xa  test. 

The  expectation  for  the  cell  L  A  is  62  X  88/141  =  38.7  and  the  contribution 
to  x2  is  (42  —  38.7)2/38.7  =  0.28.  Similarly,  the  expectation  for  the  cell  M  A 
is  21  x  88/141,  etc. 

The  degrees  of  freedom  for  nx  X  n2  tables  are  (nx  —  1)  X  (n2  —  1),  or  here 
(2 —  1)  (3  —  1)  =  2.  The  value  of  x2  comes  out,  summing  for  all  six  cells,  as 
1.5,  which  is  very  much  less  significant  than  the  5%  level.  Accordingly  we  can 
conclude  that  the  data  is  in  excellent  accord  with  our  hypothesis. 

Non-statistically,  we  would  have  calculated  the  ratios  A/B  for  raw  material 
classes  L,  M,  and  N  as  42/20  =  2.10,  13/8  =  1.63,  and  33/25  =  1.32.  We 
would  probably  have  concluded  that  these  ratios  were  significantly  different.  The 
use  of  the  x2  test,  however,  shows  that  such  a  conclusion  would  be  unwarranted. 

(2)  An  interesting  discussion  of  how  faulty  conclusions  can  easily  be  reached  from  2x2 
table  is  by  J.  Berkson  on  hospital  data  :    Biometrics  Bulletin,  Vol.  2,  No.  3,  page  47,  1946. 
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As  for  a  2  X  2  table,  so  for  a  2  X  n  there  is  a  convenient  mechanical  method 
foi  computing  x2-     Suppose  the  frequencies  in  pairs  of  cells  are  a  and  A,  b  and  B, 
etc.,  and  the  corresponding  totals  n  and  N.     Then  we  calculate  for  each  pair 
p  =  a/(a  +  A)  and 
ap  =  a«/(a  +  A) 
and  from  the  totals 

P  =  n/(n  +  N) 
and  then 

X'  =  p(pil)  P(ap)-nP] 
Our  three  pairs  give  as  values  of  ap 

422/62  =  28.452,  132/21  =  8.048,  and  332/58  =  18.776 
£  (ap)  is  then  55.276 

and    P  =  88/141    =    0.624 
and  nP  =  88a/141  =  54.972 
Whence  x2   =  (55.276  —  54.972)/0.624  x  0.376  =  1.51  as  before* 


(g)  The  iij  X  n2  Table 

An  nt  X  n,  table,  with  nx  =  3,  n2  =  6  is  given  below. 


a 

b 

c 

d 

e 

f 

Total 

A 
B 
C 

11 

17 

6 

23 
29 
21 

8 
10 

8 

5 

17 
24 

18 

7 

15 

18 

15 

9 

83 
95 
83 

Totals 

34 

73 

26 

46 

40 

42 

261 

A  product  was  being  produced  on  three  shifts,  A,  B,  and  C  and  was  classified 
into  six  grades,  the  order  of  quality  being  a  to  f. 

On  the  average  each  of  the  three  shifts  would  be  supplied  with  raw  material 
of  the  same  average  quality.  The  question  arose  as  to  whether  any  shift  was 
tending  to  produce  better  quality  material  than  the  others,  i.e.  was  there  a  relative 
excess  of  particular  qualities  on  any  one  shift. 

To  test  the  hypothesis  that  there  is  no  association  of  quality  with  shift,  we 
calculate  x2  for  this  table.  The  expectation  of  the  cell  Aa  is  83  X  34/261  =  10.8, 
and  the  contribution  to  x2  is  (11  —  10.8)2/10.8  =  0.004.  Similarly  the  expect- 
ation for  the  cell  Ba  is  95  X  34/261  ==  12.38,  and  the  contribution  to  x"  is 
(17—12.38)2/12.38=  1.72. 

Performing  the  operation  for  all  the  cells  and  summing  all  the  contributions, 
we  get  for  x2  the  value  26.3.  The  degrees  of  freedom  with  which  we  enter  the 
X2  table  is  given  by  the  number  of  independent  ways  the  contingency  table  can  be 
filled  up,  given  the  marginal  totals.  For  an  nt  X  n2  contingency  table  this  is 
(nx  —  1)  (n2  —  1),  here  5  X  2  =  10.  Entering  the  x2  table  for  10  degrees  of 
freedom  with  the  value  26.3  for  x2  we  nave  the  probability  of  getting  so  large 
a  value  as  less  than  0.01  (i.e.  1%).  Accordingly  the  hypothesis  that  there  is  no 
association  of  grade  with  shift  would  seem  to  be  incorrect,  i.e.  there  is  an  association 
of  quality  and  shift. 

The  x2  test,  should  it  give  significance  to  the  association  between  the  two 
variables,  does  not  give  any  information  as  to  the  type  of  association.  This  can 
only  be  found  by  inspection  of  the  original  contingency  table.  To  assist  in  this 
it  is  convenient  to  enter  in  each  cell,  alongside  the  observed  frequency,  the  expected 
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frequency.     Doing  this   for  the  present  example,   we  find   that  A  shift  were 
producing  a  relative  excess  of  e  and  f,  B  shift  of  a  and  C  shift  of  d. 

(h)  Restriction  of  Expected  Cell  Frequency  to  be  not  less  than  5 

As  mentioned  at  the  beginning  of  this  discussion  of  x2,  it  is  not  safe  to  use 
the  x3  test  where  any  expected  class  frequency  is  less  than  5.  Sometimes  this 
means  that  the  test  cannot  be  applied.  On  other  occasions  the  difficulty  can  be 
evaded. 

Consider  the  table  below ; 


u 

U 

M 

N. 

N2 

Total 

A 
B 

5 
3 

37 
17 

13 
8 

28 
20 

5 
5 

88 
53 

Totals 

8 

54 

21 

48 

10 

141 

It  represents  the  data  from  which  Table  5  in  section  (f)  was  built  up.  The 
expectations  for  cells  h%B,  and  NgB  are  both  less  than  5.  As  the  table  stands, 
therefore,  the  x2  test  cannot  be  applied.  However,  if  we  pool  the  data  in  columns 
Lx  and  L2  and  in  columns  Nj  and  Ng,  we  get  the  2x3  table  quoted  in  section  (f), 
Table  5.2,  which  is  perfectly  satisfactory. 


(CHAPTER  VI 
THE  PQISSON  DISTRIBUTION 

(a)  Introduction 

Consider  a  manufacturing  process  in  which  an  operation  is  being  repeated 
a  great  many  times  each  day,  and  occasionally  an  inflammation  occurs.  If  we 
set  out  in  time  sequence  the  daily  number  of  inflammations  we  may  get  such  a 
set  of  figures  as 

0  0  10  0  2  0  3  0  0  0  13  0  14  0  0  0  2    

Some  days  will  be  without  any,  others  will  have  a  relatively  large  number.  The 
question  we  will  discuss  is  whether  the  probability  of  obtaining  an  inflammation 
is  constant  throughout.  Prime  facie  we  might  consider  that  the  days  wit  h  0  or  1 
incidents  have  low  probabilities  of  obtaining  an  inflammation  and  the  days  with 
3,  4  or  more  high  probabilities,  If  this  were  the  case,  then  we  could  look  for 
assignable  causes  of  variation  in  the  probability,  i.e.  find  out  in  what  way  the 
days  with  high  numbers  of  inflammations  differed  from  days  with  low  numbers, 
with  a  view  to  correcting  the  manufacturing  practice.  On  the  other  hand,  if  the 
probability  were  really  constant  throughout  we  should  be  wasting  our  time  trying 
to  find  out  in  what  ways  the  days  differed, 

If  we  assume  that  the  probability  of  obtaining  an  incident  at  any  one  instant 
is  small  but  finite  and  constant,  then  over  a  finite  period  the  chance  of  obtaining 
an  incident  will  be  appreciable.  The  incident  can  be  of  any  type  (we  are  con- 
sidering spontaneous  ignitions  in  a  manufacturing  process)  and  can  include  the 
occurrence  of  defective  articles  in  their  mass  production.  In  this  latter  case  we 
must  pay  due  regard  to  the  proviso  that  the  probability  of  obtaining  an  incident 
at  any  one  instant  (in  the  terminology  of  this  example  of  obtaining  a  defective  on 
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selecting  any  particular  article)  is  small.     In  practice  the  approximation  is  good 
enough  if  the  percentage  defective  is  less  than  10%. 

(b)  Number  of  Incidents  per  Interval 

To  revert  to  our  example  of  spontaneous  inflammation,  let  us  count  up  the 
number  of  days  on  which  no  incident  occurs,  on  which  1  incident  occurs,  on 
which  2  incidents  occur,  etc.  Then  if  our  hypothesis  is  correct,  the  relative 
frequencies  will  be  as  in  Table  6.1  below  (the  e  is  the  base  of  natural  logarithms, 
m  is  the  average  frequency,  and  e.g.  4  !  is  factorial  4,  i.e.  4  X  3  x  2  x  1). 

Table  6.1 


Number  of 

Incidents 

in  one  day 

Relative 
Frequency 

0 
1 
2 
3 
4 
etc. 

me-m/1  ! 
m2e-m/2  ! 
m3e-m/3  ! 
m4e-m/4  ! 
etc. 

This  series  of  terms  is  known  as  the  Poisson  series. 
The  sum  of  the  Relative  Frequencies  is  unity,  for  it  is 

e-m  -f-  m  e-m/1  !  +  m2e-m/2  !  +  m8e~m/3  !  + 

=  e-m  [1  +  m/1  !  +  m2/2  !  +  ms/3  !  + ] 

=  e-m  x  em  ==  1 
The  first  two  columns  in  Table  6.2  summarise  the  results  observed  on  201 
days  working  of  an  industrial  process.  The  incident  was  an  inflammation  that 
would  occur  in  a  repetition  process  that  was  being  repeated  a  large  number  of 
times  each  day.  Over  this  period  of  201  days  there  were  150  incidents,  and  they 
were  distributed  as  in  Table  6.2  Thus  there  were  102  days  without  any  incidents, 
59  days  with  1  incident,  31  days  with  2  incidents,  etc.  The  total  number  of 
days  is  given  by 

102  +  59  +  31  +  8  +  0  +  1  =  201 
and  the  total  number  of  incidents  is  given  by 

102x0  +  59x1  +  31x2  +  8x3  +  0x4+1  x5  =  150 

Table  6.2 


Number  of 

Incidents 

on  one  Day 

Number  of 
Occurrences 
of  this  type 

Expectation 

of 
Occurrences 

0 
1 
2 
3 
4 
5 
6 

102 
59 
31 

8" 

0 

1 

0^ 

>9 

95.3 
71.1 
26.5 
6.6"! 

Total 

201 

201.0 
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We  have  150  incidents  in  201  days,  so  the  daily  average  frequency  m  is  0.7463. 
A  table  of  natural  logarithms  gives  us  em  as  2.109  :  em  is  the  reciprocal  of  this, 
namely  0.4742.  The  initial  term  of  the  series,  corresponding  to  0  incidents  in 
a  day,  is  Ne-m  =  201  x  0.474  =  95.3.  The  next  term  is  mNe-m/1  !  = 
0.7463  X  95.3  =  71.1.  The  next  term  is  m2Ne-m/2  !  =  0.7463  X  71.1/2  = 
26.5,  and  so  on.  These  Poissonian  expectations  are  given  in  the  last  column  of 
Table  6.2. 

To  test  the  significance  of  the  departure  of  the  observed  frequencies  from 
the  expected  frequencies  we  use  the  x2  test  in  the  usual  way,  as  for  a  1  X  n  table. 
Thus  the  first  cell's  contribution  to  x2  is 
(102  —  95.3)* 
95.3 


=  0.47 


and  similarly  for  the  other  cells.  In  applying  the  x2  test  it  is  essential  to  pool 
sufficient  cells  at  the  bottom  end  of  the  table  to  satisfy  the  condition  that  the 
expected  frequency  in  any  cell  should  not  be  less  than  5. 

A  further  essential  condition  in  applying  the  x2  test  is  that  we  should  take  as 
the  degrees  of  freedom  (n  —  2).  The  reason  for  losing  2  degrees  of  freedom  as 
compared  with  1  as  in  the  usual  n  X  1  table  is  that  we  have  not  only  made  the 
totals  agree  (as  in  the  1  X  n  table,  this  taking  up  1  degree  of  freedom)  but  also 
used  one  constant  in  fitting  the  shape  of  the  distribution,  this  latter  taking  the 
second  degree  of  freedom. 

In  the  above  example  we  obtain  a  value  of  3.4  for  x2  with  2  degrees  of  freedom. 
This  corresponds  to  a  probabiilty  of  15%,  so  the  fit  can  be  considered  satisfactory 
(P  would  have  to  be  5%  or  less  before  we  .could  consider  the  hypothesis  of  agree- 
ment between  the  observed  and  expected  distribution  disproved). 

It  will  be  noted  that  the  shape  of  the  Poisson  distribution  depends  on  the 
value  of  m,  the  average  frequency.  If  m  is  less  than  1,  as  in  the  previous  example, 
then  each  term  must  be  less  than  the  preceding  one,  and  the  first  term  will  be 
the  largest. 

If  m  is  greater  than  1  but  less  than  2,  then  the  second  term  is  larger  than 
the  first,  since  it  is  equal  to  the  first  multiplied  by  m  and  divided  by  1.  It  will 
also  be  greater  than  the  third,  so  it  is  the  maximum  term  of  the  series.  If  m 
is  greater  than  2,  but  less  than  3,  then  the  third  term  will  be  larger  than  the  second 
and  larger  than  the  fourth. 

Table  6.3  below  is  derived  from  the  same  data  as  the  previous  example.  The 
interval  taken  was  3  days,  so  the  average  frequency  is  now  2.2537. 

Table  6.3 


Number  of 

Incidents  in 

Number  of 

Expectation 

periods  of 

Occurrences 

of 

three  days. 

of  this  type. 

Occurrences. 

0 

8 

7.04 

1 

20 

15.86 

2 

13 

17.87 

3 

12 

13.42 

4 

6 

7.56 

5 

5  I 

3.411 

6 

2 

►  8.0 

1.28 

7 

0 

0.41  >5.24 

8 

1 

0.11 

9 

0.03  J 
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It  will  be  observed  that  the  expected  frequencies  correspond  closely  to  the 
observed  frequencies,  x2  —  4.49  for  4  degrees  of  freedom,  this  corresponding  to 
a  probability  of  40%. 

(c)  Distribution  of  Time  Intervals 

We  have  considered  the  number  of  incidents  per  time  interval.  A  slightly 
different  way  of  looking  at  the  matter  is  to  consider  the  size  of  interval  between 
each  incident.  The  interval  can  range  from  zero  up  to  ten  days  or  more.  If 
we  assume  that  the  probability  of  obtaining  an  incident  is  constant  throughout, 
we  can  predict  the  relative  frequencies  of  the  various  sizes  of  intervals.  Let  m 
be  the  reciprocal  of  the  average  interval,  and  N  the  total  number  of  intervals 
observed.     Then  the  number  of  intervals  laying  between  t.  and  t2,  would  be  given  by 

N  (e-mtj  —  e-mta) 
where  e  is  the  base  of  natural  logarithms. 

Table  6.4  gives  the  distribution  of  time  intervals,  derived  from  the  same  data 
as  the  previous  table. 

Table  6.4 


Intervals 

Number 

Number 

(hours) 

Observed. 

Expected. 

0—  7.9 

48 

44.46 

8—15.9 

36 

34.63 

16—23.9 

-27 

26.95 

24—31.9 

24 

21.00 

32—39.9 

10 

16.44 

40—47.9 

13 

12.64 

48—55.9 

10 

10.01 

56—63.9 

8 

7.64 

64—71.9 

9 

6.03 

72—79.9 

3 

4.76 

80  to  infinity 

13 

16.45 

Total 

201 

201.01 

L~M 


The  computation  for  this  table  is  easier  if  we  take  8  hours  as  a  unit  of  time. 
Using  a  finer  distribution  of  intervals  than  is  shown  in  Table  6.4,  the  average 
interval  is  3.9950  units  and  m  its  reciprocal  was  found  to  be  0.2503. 

The  expectation  for  the  first  interval  is  then 


N(e-mtJ  —  e-mt2) 


V 


=  201  (e-0-2603  x  ° e  -°-a608  x  x) 

=  201  (1  —  0.7788) 
=  44.46 
The  expectation  for  the  second  interval  is  then 
N(e-mt2  —  e-mt3) 

_   201    (e-°.8603    XI  e_0.2503    X    i\ 

=  201  (0.7788  —  0.6065) 
=  34.63 
The  complete  expectations  are  given  in  the  last  column  of  Table  6.4. 
The  closeness  of  fit  can  be  tested  with  the  xa  test.     Here  a  value  of  6.13  is 
obtained  which  with  (11  —  2)  =  9  degrees  of  freedom  corresponds  to  a  probability 
to  75%.     The  fit  is  thus  excellent. 

The  time  distribution  of  incidents  when  expressed  in  the  present  form  is 
rather  curious  in  that  the  commonest  interval  is  the  shortest. 
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It  is  interesting  to  note  that  the  two  methods  test  rather  different  aspects  of 
the  distribution.  A  large  number  of  long  periods  (say  4  days  or  more)  would 
show  up  in  the  present  method  as  an  excessive  number  in  the  last  category,  and 
since  the  expected  number  in  this  category  is  small  it  would  not  require  many 
to  give  a  significantly  large  contribution  to  x2-  The  first  method  described, 
however,  would  record  this  in  the  category  of  days  with  zero  incidents,  and  since 
this  is  frequently  a  large  class  the  additional  number  would  have  to  be  very  large 
to  make  this  cell  give  a  significantly  large  contribution  to  x2.  Clearly  a  number 
of  days  with  zero  incidents  is  more  remarkable  if  they  occur  as  a  series  consecutively 
than  if  they  are  interspersed  with  days  with  incidents,  and  the  latter  treatment  is 
more  sensitive  to  this  form  of  departure  from  the  Poisson  distribution.  On  the 
other  hand,  the  former  treatment  is  more  sensitive  to  the  occurrence  of  a  large 
number  of  incidents  in  a  short  time. 


CHAPTER  VII 
THE  ANALYSIS   OF  VARIANCE 

(a)  Introduction 

It  is  a  valuable  property  of  variance  that  if  a  process  has  a  number  of  factors 
each  making  a  contribution  to  the  variance  of  the  final  product,  then  this  total 
variance  is  equal  to  the  sum  of  the  component  variances (x). 

This  statement  is  less  obvious  than  it  may  seem.  Thus,  if  we  were  using 
standard  deviations  as  our  measure  of  variability,  it  would  not  be  true  to  say  that 
the  standard  deviation  of  the  final  product  was  equal  to  the  sum  of  the  standard 
deviations  produced  by  the  several  factors. 

This  property  of  additiveness  of  variance  makes  possible  the  technique  known 
as  the  "Analysis  of  Variance,"  whereby  the  total  variance  of  a  process  can  be 
analysed  into  its  component  factors,  the  relative  importance  of  which  can  then  be 
assessed. 

The  Analysis  of  Variance  can  take  many  forms  according  to  the  structure  of 
the  process  being  analysed,  and  one  of  the  principal  difficulties  usually  found  in 
its  application  is  deciding  which  is  the  appropriate  form.  In  the  present  chapter 
only  the  two  simplest  forms  will  be  discussed,  and  in  the  succeeding  chapter  a 
more  general  treatment  for  dealing  with  the  more  complex  cases  frequently  met 
in  practice  will  be  attempted. 

(b)  Analysis  of  Variance  Between  and  Within  Batches 

Suppose  that  we  have  a  two  stage  process.  Stage  A  takes  the  raw  material 
in  batches  of  300  lb.  at  a  time  and  performs  operation  A  on  it.  This  operation 
is  not  perfectly  reproducible  and  gives  rise  to  a  variance  a  I  in  the  final  product. 
For  stage  B  of  the  process,  each  300  lb.  batch  is  split  into  three  100  lb.  units  and 
operation  B  performed  on  these  units.  Operation  B  is  also  not  perfectly  repro- 
ducible and  gives  rise  to  a  variance  a|  in  the  final  product.  The  problem  consists 
of  observing  the  total  variance  in  the  final  product  and  analysing  it  into  its  two 
components  a  I  and  a  J.  Having  done  this,  we  will  then  be  able  to  decide  which 
is  the  part  of  the  process  which  requires  the  more  careful  control  in  order  to 
reduce  the  total  variability. 

In  evaluating  a  |  and  c  £  we  proceed  by  first  calculating  (n  a  I  +  a  I)  and 
a  |  (in  the  general  case  where  there  are  n  units  per  batch).  The  reason  for  this 
is  that  it  makes  the  estimation  of  the  significance  of  a2B  practicable.  The  function 
(n  a\  +  a|)  has  (m  —  1)  degrees  of  freedom,  where  m  is  the  number  of  batches, 

(*)  Assuming  the  Factors  are  acting  independently. 
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and  o  I  has  m  (n  —  1)  degrees  of  freedom.  For  o{  to  exist,  therefore,  (n  a\  +  a2) 
must  be  greater  than  o^ ,  and  we  can  readily  test  this  with  the  variance  ratio  test. 

In  Table  7.1,  below,  the  value  of  the  property  for  each  unit  has  been  set  down 
in  columns  of  3,  the  three  units  in  each  column  coming  from  a  single  batch. 

In  order  to  reduce  the  numbers  to  small  numbers  (for  convenience  in  arith- 
metical computation)  a  constant  has  been  subtracted  from  all  of  them  :  it  is 
obvious  that  this  shift  in  level  will  in  no  way  affect  the  relative  variability. 


Table  7.1 

Batch  No. 

1 

2        3 

4 

5         6 

7 

8 

9 

10 

—3 
—2 
—3 

—4    —3 
—3    —1 
—5    —4 

— 1 

3 
2 

4    —2 
3        3 
6        1 

1 
0 
1 

2 
1 
1 

— 1 
— 1 

2 

— 1 
1 

1 

Grand 
Total 

Totals 

—8 

—12    —8 

4 

13        2 

2 

4 

0 

1 

—2 

The  process  of  computation  is  as  follows  : — 

(1)  Square  the  individuals  and  add,  i.e. 

(— 3)2  +  (— 4)2  +  (— 3)2  +  .  .  .  +  l2  +  22  +  l2  =  204 

(2)  Obtain  the  total  for  each  column,  square  these  totals,  sum  these  squares, 
and  divide  this  total  by  the  number  of  individuals  in  each  column,  i.e. 

((— 8)2  +  (— 12)2  +  (— 8)2  +  .  .  . ,  +  02  +  l2)/3  =  160.67 

(3)  Obtain  the  grand  total  for  all  individuals,  square  this  grand  total,  and 
divide  by  the  grand  total  number  of  individuals,  i.e. 

(—3—4—3 +  1  +  2  +  l)2/30  =  0.13 

We  then  form  a  table  of  this  analysis  of  variance  :— 

Table  7.2 


Source  of 
Variance 

Sum  of  Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Components 
of  Variance 

Between  columns 
Within  columns 

(2)— (3)  =160.54 
(1)— (2)  =43.33 

m—  1  =    9 
mn  — m  =  20 

17.84 
2.17 

n  ol  +  oj 

<4 

Total    .. 

(1) — (3)=203.87 

mn  —1  =  29 

m     =  number  of  columns, 
n      =  number  of  individuals  in  each  column. 
a  |  =  Variance  due  to  differences  between  columns. 
a  ,2{  ==  Variance  within  columns. 
The  three  Sums  of  Squares  are  the  differences  between  the  appropriate 
terms  (1),  (2)  and  (3),  calculated  above.     The  degrees  of  freedom  are  derived 
as  follows  : — 

(a)  Between  Columns.  There  are  m  columns,  hence  the  degrees  of  freedom 
for  this  term  is  (m — 1)  (the  degrees  of  freedom  in  general  equal  the 
number  of  ways  the  group  concerned  can  be  arbitrarily  filled  in  if  the 
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total  is  determined,  thus  when  (m  —  1)  have  been  filled  in  the  last  one  is 
uniquely  determined,  i.e.  we  can  only  fill  (m —  1)  classes  arbitrarily.) 

(b)  Within  Columns.  There  are  n  individuals  in  each  column  :  each  column 
therefore  contributes  (n  —  1)  degrees  of  freedom  to  this  term  :  there  are 
m  columns,  hence  the  total  degrees  of  freedom  is  m  (n  —  1)  =  mn  —  m. 

(c)  Total.  The  total  degrees  of  freedom  are  mn  —  1,  one  less  than  the  total 
number  of  observations. 

The  Mean  Squares  are  the  Sums  of  Squares  divided  by  the  degrees  of  freedom. 

The  word  "Squares"  in  the  column  headings  in  the  table  refers  to  Squares 
of  deviations  from  means,  and  so  the  Mean  Squares  terms  have  some  of  the 
characteristics  of  variances. 

The  last  column  shows  that  the  Between  Column  Mean  Square  estimates 
(n  afv  +  a2n)  and  the  Within  Column  Mean  Square  estimates  a?,.  Thus  if  o^ 
exists  significantly,  then  the  Between  Column  Mean  Square  must  be  significantly 
greater  than  the  Within  Column  Mean  Square. 

Accordingly,  we  wish  to  test  whether  the  Between  Column  Mean  Square  is 
significantly  greater  than  the  Within  Column  Mean  Square.  This  can  be  done 
with  the  Fisher  variance  ratio  test,  discussed  earlier  in  Chapter  IV  (a). 

We  merely  calculate  the  ratio  of  the  larger  mean  square  to  the  smaller,  here 
17.84/2.17  =  8.22,  and  enter  the  table  for  F  with  nx  =  degrees  of  freedom  of 
the  larger  variance  (here  nx  =  9)  and  n2  =  degrees  of  freedom  of  the  smaller 
variance  (here  n2  =  20).  If  our  value  for  F  is  greater  than  that  given  in  the  table, 
then  the  result  is  more  significant  than  the  level  of  the  table.  Here  for  example, 
for  nx  =  8  (the  table  does  not  give  n1  =  9),  n2  =  20,  F  has  the  values  2.45,  3.56 
and  5.44  for  the  5%,  1%  and  0.1%  levels  of  significance.  Accordingly  this 
particular  result  is  more  significant  than  0.1%. 

Having  thus  established  that  the  Between  Columns  Mean  Square  is  signi- 
ficantly greater  than  the  Within  Column  Mean  Square,  i.e.  that  the  Between 
Column  Variance  does  exist,  we  can  proceed  to  calculate  its  value.     We  have 
n  o\  +  a|  =  17.84 
4=  2.17 
where  n  —  number  of  individuals  in  each  column,  here  3. 

Thus  c\  =  (17.84  —  2.17)/3 
=  5.22 
o2B  =  2.17 

The  total  variance  observed  in  our  original  data  is  thus  made  up  of  these 
two  components  a2A  =  5.22  due  to  differences  between  columns  (here  columns  = 
batches)  and  c2n  =  2.17  due  to  variability  within  columns  (here  within  batches, 
that  is  to  say,  between  units  for  a  given  batch). 

(c)  The  Investigation  of  Multi- Stage  Processes 

We  can  thus  see  that  the  larger  proportion  of  the  total  variability  in  the  above 
process  is  arising  at  stage  A,  and  only  a  minor  part  at  stage  B.  Thus  in  any 
attempt  to  standardise  the  process,  to  reduce  the  total  variability  in  the  final 
product,  almost  all  the  attention  should  be  directed  towards  stage  A. 

The  total  variance  as  the  process  is  at  present  is  ap  =  a'2  -f-  aj  =  5.22  -f 
2.17  =  7.39,  i.e.  its  standard  deviation  is  V7.39.  Our  estimate  of  the  spread 
within  which  95%  of  all  individuals  should  lie  is  thus  ±  1.96  X  V7.39  =  ±  5.3, 
or  a  total  spread  of  2  x  5.3  =  10.6.  From  inspection  of  the  original  data  this 
is  seen  to  be  reasonable  :  out  of  the  thirty  observations  the  smallest  is  — 5  and 
the  largest  is  -f  6,  a  spread  of  11.0. 

Suppose  we  directed  our  efforts  at  standardising  stage  B,  and  were  com- 
pletely successful,  and  reduced  o|  to  zero.  The  total  variance  would  now  be 
ol  =  5.22,   i.e.   the   standard   deviation   would   be   2.29   and   the   95%    spread 
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±  1.96  x  2.29  =  ±  4.5,  or  the  total  spread  of  9.0.  The  total  spread  originally 
was  10.6,  so  the  improvement  has  been  comparatively  slight. 

If,  on  the  other  hand,  we  had  directed  our  efforts  at  standardising  stage  A 
and  were  completely  successful  in  reducing  al  to  zero,  the  total  variance  would 
now  be  c|  =  2.17  or  the  standard  deviation  1.47.  The  95%  spread  would  then 
be  ±1.96  X  1.47  =  ±2.9  and  the  total  spread  5.8. 

The  improvement  effected  by  standardising  stage  A  is  thus  very  much  more 
substantial  than  that  achieved  by  standardising  stage  B.  It  is  clear  that  effort 
spent  in  investigating  stage  B  would  have  been  largely  wasted.  Before  carrying 
out  a  detailed  physico-chemical  investigation  on  a  multi-stage  process,  it  is  clearly 
desirable  to  find  out  with  such  an  analysis  of  variance  which  are  the  stages  giving 
rise  to  the  variability,  and  concentrate  upon  them.  The  advantage  of  this  pro- 
cedure is  that  the  stages  subsequently  ignored  may  have  contained  a  large  number 
of  variables,  each  of  which  would  have  required  investigation.  In  carrying  out 
these  investigations,  we  could  not  be  confident  that  we  were  including  the  signi- 
ficant variable,  and  the  failure  to  obtain  a  significant  correlation  might  be  due  to 
the  investigation  of  the  wrong  variables  rather  than  to  the  fact  that  there  was  no 
significant  variable,  for  this  part  of  the  process. 

The  analysis  of  variance  proceeds  by  concentrating  initially  upon  the  structure 
of  the  process,  and  only  subsequently  investigating  individual  variables. 

(d)  Analysis  of  Variance  of  Columns  of  Unequal  Size 

It  will  have  been  realized  that  the  analysis  of  variance  between  and  within 
columns  is  in  effect  an  extension  of  the  Student  t  test :  instead  of  comparing 
two  means  we  are  comparing  m  means,  where  m  is  the  number  of  columns.  It 
might  be  thought  that  we  might  make  a  series  of  t  tests  for  the  column  means 
taken  two  at  a  time.  This  would  involve  a  lot  of  labour,  and  in  addition  involves 
us  in  certain  complications.  Thus  if  we  had  20  comparisons,  and  we  are  working 
to  the  5%  level  of  significance,  then  on  the  average  even  if  the  columns  were 
drawn  from  the  same  population  we  would  expect  one  of  them  to  give  us  a  value 
of  Student's  t  as  large  as  the  5%  value.  The  use  of  the  analysis  of  variance 
evades  these  difficulties. 

It  sometimes  happens,  however,  that  the  number  of  individuals  in  each 
column  are  not  the  same,  when  the  treatment  given  in  Section  (b)  will  not  be 
directly  applicable. 

Consider  the  data  in  the  table  below.  It  represents  the  throughputs  obtained 
from  units  of  plant  before  failure  through  corrosion.  The  units  are  categorised 
according  to  the  foundry  manufacturing  them. 


Foundry 

Throughput  obtained 

Total 

Mean 

A 
B 
C 

84,  60,  40,  47,  34 

67,  92,  95,  40,  98,  60,  59,  108,  86 

46,  93,  100 

265 
705 
239 

53.0 
78.3 
79.7 

Prima  facie  it  might  be  concluded  that  the  pots  from  Foundry  A  are  giving 
lower  throughputs  on  the  average  than  those  from  Foundries  B  and  C. 
The  analysis  proceeds  by  obtaining  the  following  terms  : — 

(1)  Square  the  individuals  and  add,  i.e. 

842  +  6Q2  +  402  + +  93*  +  1002  =  95,709 

(2)  Square  each  row  total,  divide  by  the  number  of  individuals  in  that  row, 
and  sum  these  terms,  i.e. 

265 «/5  +  7052/9  +  2392/3  =  88,310 

(3)  Square  the  grand  total  and  divide  by  the  total  number  of  individuals,  i.e. 

(265  +  705  +  239)2/17  -  85,981 
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The  analysis  of  variance  is  then  as  below  : 


Source  of  Variance 

Sums  of  Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Between  Foundries 
Within  Foundries 

(2)— (3)  =  2329 
(1)— (2)  =  7399 

2 
14 

1164 
528.5 

Total 

(1)— (3)  =  9728 

16 

The  degrees  of  freedom  are  derived  as  follows  : — 

(a)  Between  Foundries :    one  less  than  the  number  of  foundries. 

(b)  Within  Foundries :  each  foundry  contributes  one  less  than  the  number 
of  observations  on  that  foundry,  viz.,  4  +  84-2=  14. 

(c)  Total :   one  less  than  the  total  number  of  observations. 

Testing  the  Between  Foundries  Mean  Square  in  the  above  analysis,  it  gives 
a  variance  ratio  of  1164/528.5  =  2.20  for  degrees  of  freedom  nx  =  2,  n2  =  14. 
This  is  considerably  less  significant  than  the  5%  level  of  significance,  so  there  is 
insufficient  justification  for  regarding  the  Foundry  A  pots  as  giving  smaller  average 
throughputs  than  those  from  the  other  foundries. 

In  the  present  instance  we  are  more  interested  in  the  actual  averages  for  the 
three  foundries,  and  are  using  the  analysis  of  variance  to  test  the  significance  of 
the  apparent  difference  between  the  averages.  In  the  more  general  case,  of  the 
analysis  between  and  within  batches,  as  discussed  in  the  two  previous  sections, 
we  probably  wish  to  calculate  the  two  components  of  variance.  The  Within 
Foundries  Mean  Square  as  before  estimates  the  Within  Foundries  variance,  i.e. 

o£=  528.5.     The  calculation  of  the  Between  Foundries  variance  is  more  com- 
plicated when  the  batch  (column)  sizes  are  unequal. 

If  N  in  the  total  number  of  observations^  and  m  is  the  number  in  each  batch 
for  i  =  1  to  k,  there  being  k  batches,  then  the  Between  Foundries  (Between  Batches 
or  Columns)  Mean  Square  estimates 

N2-Sni2_2  , 

N(k-1)°B  +  °A 
It  will  be  noted  that  in  the  case  where  all  the  batches  (columns)  are  of  equal  size, 
i.e.  all  the  niS  are  equal  to  n  this  reduces  to 

(nk)'-kn«     ,  , 

nk(k-l)    °B  +  °A 

2       i  2 

=  n  aB  +  oA 

as  quoted  in  Chapter  VIII  (b). 

In  our  present  example  our  ni  are  respectively  5 
k  bss  3,  and  thus  we  get 

172  —  (52  -h  92+  32) 


9,  3,  N  =  5  +  9  +  3  ==  17, 


1164.5  = 


17(3-1) 


h  + 


°k 


=  5.118    ag  -f-  a^ 


Subtracting 


We  obtain 


528.5    =  a\ 


or         aB 


636.0    =  5.118  og 
=  124.3 
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It  might  be  re-emphasised  that  the  calculation  of  the  component  of  variance 
had  been  given  here  only  to  illustrate  the  method  .of  calculation.  In  practice 
with  this  particular  problem  it  would  not  be  done  because  firstly  we  would  be  more 
interested  in  the  averages,  secondly  the  effect  is  so  non-significant  that  its  calcula- 
tion would  be  meaningless,  the  component  being  indistinguishable  on  the  present 
data  from  zero.  It  further  would  not  be  desirable  practice  to  estimate  the  variance 
between  batches  (column)  from  only  three  batches  :  a  variance  so  estimated  would 
have  a  very  large  error.  Finally,  this  is  a  case  where  it  might  be  held  that  it 
was  unreasonable  to  suppose  that  there  was  an  infinite  population  of  foundries 
from  v/hich  these  three  are  a  sample  :  this  point  is  commented  on  further  in 
Chapter  XII  (b). 

(e)  Analysis  of  Variance  into  Components  due  to  Rows,  Columns, 
and  a  Residual 

It  frequently  happens  that  not  only  can  the  individuals  (units)  be  categorised 
according  to  batches  (columns)  but  also  simultaneously  according  to  rows.  Thus 
in  the  previous  example,  the  first  individual  in  every  column  (batch)  was  processed 
on  a  particular  machine,  all  the  second  individuals  in  every  column  on  another 
machine,  and  all  the  third  individuals  on  a  third  machine.  Each  row  thus  corres- 
ponds to  one  particular  machine.  There  may  be  a  systematic  difference  between 
these  machines  ;  we  wish  to  test  whether  such  an  effect  exists,  and  if  it  does  exist 
estimate  its  contribution  to  the  total  variance  of  the  final  product. 

Formally  what  we  have  to  deal  with  here  is  a  twofold  classification;  every 
individual  belongs  to  a  particular  row  and  to  a  particular  column. 

Other  examples  of  twofold  categorisation  are  the  output  from  a  shop  accord- 
ing to  machine  and  according  to  the  shift  operating  the  machines.  Or,  in  an 
experiment  on  conditions  of  nitration  of  cellulose,  we  could  have  n1  different  acid 
concentrations,  corresponding  to  rows,  and  n2  different  acid  temperatures,  corres- 
ponding to  columns,  where  nx  and  n2  could  be  any  convenient  numbers. 

To  revert  to  our  example  of  batches  split  into  three  units,  we  proceed  by 
calculating  the  following  terms : — 

(1)  Square  the  individuals  and  add  (we  have  done  this  already — term  (1)  of 
the  previous  analysis). 

(2)  Obtain  the  total  for  each  row,  square  these  totals,  sum  these  squares, 
and  divide  this  total  by  the  number  of  individuals  in  each  row  (this  is 
the  new  term,  here  equal  to  8.40). 

(3)  Obtain  the  total  for  each  column,  square  these  totals,  sum  these  squares, 
and  divide  this  total  by  the  number  of  individuals  in  each  column  (term 
(2)  of  the  previous  analysis). 

(4)  Obtain  the  grand  total  for  all  individuals,  square  this  grand  total  and 
divide  by  the  grand  total  number  of  individuals  (term  (3)  of  the  previous 
analysis). 

We  then  form  a  table  of  this  analysis  of  variance. 


Source  of 
Variance 

Sum  of  Squares 

Degrees  of 
Freedom 

Mean 
Square 

Components 
of  Variance 

Between 

Rows 
Between 

Columns 
Residual 

(2)-W- 

8.40  —  0.13 
(3) -(4)  = 

160.67  —  0.13 
(l)  +  (4) 

-(2) -(3) 

nx  —  1  =  2 
n,  -  1  =  9 
(ni-l)(n2-l)=18 

4.13 

17.84 

1.94 

n2  °l  +  CTo 
n,  o\  +  al 

Total 

(l)-(4)  = 

203.87 

n,  n2  —  1  =  29 
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nx  =  number  of  rows  =  3 

n2  =  number  of  columns  =  10 

a*  as  variance  due  to  differences  between  rows 

a2  =  variance  due  to  differences  between  columns 

2 

a2    =  residual  variance 

0 

The  Residual  Sum  of  Squares  can  be  obtained  as  that  combination  of  terms 
indicated,  but  in  practice  it  is  easier  to  calculate  the  Row  and  Column  Sums  of 
Squares,  and  find  the  Residual  as  the  difference  between  their  sum  and  the  Total 
Sum  of  Squares. 

To  test  for  the  significance  of  the  Between  Rows  Mean  Square  we  compare 
it  with  the  Residual  using  the  variance  ratio  test.  Here  the  variance  ratio  is 
4.13/1.94  =  2.12  for  degrees  of  freedom  nx  =  2,  n2  =  18.  Referring  to  the 
tables  in  the  Appendix  we  see  that  this  is  not  significant.  Thus  our  hypothesis, 
that  there  was  no  systematic  difference  between  the  rows  (machines),  is  in  reason- 
able accord  with  this  data.  If  the  Between  Rows  Mean  Square  had  proved 
significant,  we  could  have  proceeded  to  estimate  the  magnitude  of  of,  the  variance 
due  to  differences  between  rows  (machines)  from  the  pair  of  equations 
n2  CT?  +  a20  =  4.13,  a20  =  1.94,  whence  a;  =  (4.13  —  1.94)/n2. 


CHAPTER  VIII 
THE   QUALITY  CONTROL  CHART 

(a)  Introduction 

The  Quality  Control  Chart  has  been  much  used  in  recent  years  as  a  means 
of  control  of  quality  of  production,  particularly  in  light  engineering  processes  such 
as  the  use  of  automatic  lathes,  presses,  etc.  It  has  not  been  used  much  hitherto 
in  the  chemical  industry  though  certain  types  of  process  seem  suitable  for  its 
application.  Here  it  will  only  be  discussed  with  reference  to  investigational 
work.*1) 

The  quality  control  chart  is  designed  to  detect  the  presence  of  "assignable 
causes  of  variation"  in  a  sequence  of  individuals.  In  effect  it  takes  "rational 
sub-groups"  (that  is,  a  group  of  individuals  more  likely  lo  be  similar  than  the 
general  population,  e.g.  individuals  from  a  particular  machine,  operator,  day, 
consignment  of  raw  material,  etc.),  obtains  from  these  rational  sub-groups  a  measure 
of  the  population  variance,  and  then  checks  that  the  sub-group  means  do  not 
vary  more  than  they  should  do  on  the  assumption  that  all  individuals  are  drawn 

(*)  The  literature  of  industrial  quality  control  is  now  very  great.  The  standard  work 
is  W.  A.  Shewhart's  "Economic  Control  of  Quality  of  Manufactured  Product"  (Macmillan  : 
1931).  Short  accounts  are  British  Standard  600R"Quality  Control  Charts,"  B.  P.  Dudding 
and  W.  J.  Jennett,  1942  ;  British  Standard  1008  "Quality  Control"  :  "A  First  Guide  to 
Quality  Control  for  Engineers,"  E.  H.  Sealy,  H.M.S.O.,  1946.  A  fuller  account  is  L.  E. 
Simon's  "An  Engineer's  Manual  of  Statistical  Methods"  (Chapman  &  Hall,  1941). 

A  recent  British  Standard,  1313,  "Fraction  Defective  Charts  for  Quality  Control," 
discusses  the  type  of  chart  for  use  when  the  observed  characteristics  is  not  a  measurement 
on  a  continuous  scale  but  is  instead  a  simple  classification  into  "Yes  or  No,  Defective  or 
Not  Defective." 

An  alternative  approach,  more  particularly  from  the  point  of  view  of  sampling  inspection, 
is  "Quality  Through  Statistics,"  A.  S.  Wharton  (Philips  Lamps  Ltd.,  Second  Edition  1946). 

A  comprehensive  discussion  of  quality  control  is  "Statistical  Quality  Control,"  E.  L. 
Grant  (McGraw-Hill,  1946). 

A  discussion  of  quality  control  applied  to  routine  chemical  analysis  is  by  H.  E.  MacColl  : 
Chemistry  and  Industry,  December  9th,  1944,  page  418. 
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from  the  same  population,  i.e.  that  there  is  no  significant  difference  between  the 
sub-group  means.  It  will  be  apparent  that  the  result  of  this  test  is  the  same  as 
the  analysis  of  variance  for  between  and  within  batches. 

The  quality  control  chart  as  used  in  practice  almost  invariably  derives  its 
estimate  of  variance  within  the  sub-groups  not  from  the  variance  but  from  the 
mean  range  within  the  sub-groups,  there  being  available  factors  for  converting 
mean  range  to  standard  deviation  (see  Section  (e)).  Having  obtained  an  estimate 
of  o  (the  within  sub-group  standard  deviation)  we  can  assert  that  the  means  of 
samples  of  n  will  be  distributed  with  standard  deviation  a/^n.  Thus  95%  will 
be  within  1.96a/y'n  on  either  side  of  X  (the  grand  mean,  assumed  equal  to  the 
population  mean.  This  is  the  same  as  1  in  40  below  X  —  1.96a/\/n  and  1  in  40 
above  X  +  1.96cr/\/n.  The  quality  control  chart  normally  uses  these  limits  as 
"inner"  limits  and  also  "outer"  limits  at  3.09a/\/n,  these  giving  99.8%  within 
the  limits,  or  1  in  1000  below  the  lower  and  1  in  1000  above  the  upper  limit. 

If  the  mean  of  a  particular  sample  falls  outside  the  limits,  it  is  probable  that 
the  underlying  assumptions  are  not  being  satisfied.  These  assumptions  are,  of 
course,  that  the  sample  is  drawn  from  a  population  of  standard  deviation  a  and 
grand  mean  X.  The  former  point  is  checked  by  a  control  chart  on  the  sub-group 
ranges  (see  next  section),  and  therefore  it  is  probable  that  the  sample's  mean  does 
differ  significantly  from  the  value  X. 


(b)  Within  Batch  Variability:    the  Control  Chart  for  Range 

Consider  the  data  used  previously  in  Table  7.1,  Chapter  VII  (b).  We  wish 
now  to  test  the  hypothesis  that  the  variabilities  within  each  column  (batch)  are 
drawn  from  a  homogeneous  population  :  in  other  words,  that  batches  do  not 
differ  in  their  internal  variability. 

Table  8.1 


Data 

—3      —4      —3 
—2      —3      —1 
—3      —5      —4 

—1 
3 
2 

4      —2 
3          3 
6          1 

1 
0 

1 

2 
1 
1 

—1      —1 

—1          1 

2          1 

x 

w 
S(x) 
S(x') 

X(x-x)* 

—2.67—4.00—2.67 

1          2          3 

—8    —12      —8 

22        50        26 

0.67     2.00     4.67 

0.33      1.00     2.33 

1.33 

4 

4 

14 

8.67 

4.33 

4.33     0.67 

3          5 

13          2 

61         14 

4.67    12.67 

2.33     6.33 

0.67 

1 

2 

2 

0.67 

0.33 

1.33 

1 

4 

6 

0.67 

0.33 

0     0.33 
3          2 
0          1 
6          3 
6.00     2.67 
3.00     1.33 

For  this  purpose  we  obtain  the  range  w  for  each  batch  and  plot  these  con- 
secutively on  a  chart  as  in  figure  6.  We  then  calculate  the  mean  range  w,  here 
equal  to  2.50,  and  plot  on  the  chart  an  inner  control  line  at  D'  0.075  w  =  2.17 
X  2.50  =  5.4  and  an  outer  control  line  at  D'  0.999  w  =  2.98  X  2.50  =  7.5. 
These  D  factors  are  obtained  from  Table  V  in  the  Appendix  for  sample  size  3. 
If  the  batches  are  similar  in  their  internal  variability,  then  on  the  average  not 
more  than  1  out  of  40  points  will  lie  outside  the  inner  control  line,  and  only  1 
out  of  1000  points  outside  the  outer  control  line.  The  occurrence  of  two  points 
outside  the  inner  control  line  within  two  or  three  points,  or  of  a  single  point 
outside  the  outer  control  line,  therefore,  is  strong  evidence  that  we  cannot  regard 
the  batches  as  having  the  same  internal  variability. 
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Figure   6 

In  the  present  instance  it  is  clear  from  figure  6  that  there  are  not  points 
outside  either  of  the  control  lines,  so  there  is  no  reason  to  suppose  that  the  batches 
differ  in  their  internal  variability. 

(c)  The  Control  Chart  for  Ranges  compared  with  Bartlett's  Test 

To  digress  from  the  present  theme,  it  will  be  noted  that  the  control  chart 
for  range  is  testing  exactly  the  same  hypothesis  as  Bartlett's  test  (see  Chapter  IV  (c)). 
To  apply  the  latter,  we  first  calculate  the  variance  of  each  batch  ai2  through  the 
steps  indicated,  namely  by  obtaining  S(x),  2(x2),  and  thus  S(x  —  x)2.  The 
sum  of  the  natural  logarithms  of  the  within  batch  variances  ai2  is  2(loge  (ai2)) 
=  3.06039.  The  degrees  of  freedom  n  of  the  individual  variances  are  2.  The 
mean  of  the  batch  variances,  S2  is  2.164,  and  k,  the  number  of  variances  being 
compared,  is  10. 

Whence  we  have 

B  =  k  n  loge  S2  —  n  S  loge  (oi2) 

=  10  X  2  x  loge  2.164  —  2  x  3.06039  =  9.31862 

p  -   1  4-  k  ±  X  -  1  4_       10+1       _  1  183 

C  "  1  +  3Tk-1  +  3x2x  10-  L183 

Thus  B/C  =  9.319/1.183  =  7.87  and  is  distributed  as  x2  with  (k  —  1)  =  9 
degrees  of  freedom.  This  is  much  less  significant  than  the  10%  level  of  signi- 
ficance, so  there  is  no  reason  to  suppose  that  the  batches  differ  from  each  other 
in  their  internal  variability. 

The  use  of  Bartlett's  Test  thus  leads  to  the  same  conclusion  as  the  control 
chart  for  ranges.  It  requires  more  calculation,  but  on  the  other  hand  has  the 
advantages  of  being  more  sensitive  and  precise. 

(d)  Between  Batch  Variability  :    the  Control  Chart  for  Means 

We  wish  to  test  whether  the  batches  differ  significantly  in  their  means,  having 
regard  to  the  observed  amount  of  variability  of  the  individuals. 

We  obtain  the  mean  for  each  batch  (first  row  in  lower  part  of  Table  8. 1)  and  plot 
this  consecutively  as  in  Figure  7.     We  calculate  the  grand  mean  X  as  — 0.67,  and 
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draw  in  inner  control  lines  at  X  ±  AV025  w  =  —  0.67  ±  0.668  X  2.50  or  1.60  and 
— 1.74  and  outer  control  lines  at  X  ±  AVooi  w  =  —0.67  ±  1.054  X  2.5  or  2.57 
and  — 2.70.  Thus,  if  the  Between  Batch  variability  is  no  greater  than  that  due 
to  the  Within  Batch  variability,  1  point  out  of  20  will  lie  outside  the  inner  control 
lines  and  2  out  of  1000  outside  the  outer  control  lines.  Here  4  out  of  10  lie  outside 
the  inner  control  lines  and  2  out  of  10  outside  the  outer  control  lines.  It  is  thus 
evident  that  the  hypothesis  of  no  between  batch  variability  is  untenable. 

CONTROL    CHART  fOR    ATS/J/VS. 
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Figure  7 

This  is  in  complete  agreement  with  the  conclusion  reached  using  the  analysis 
of  variance  in  Chapter  VII  (b).  With  the  control  chart  the  arithmetical  labour 
is  much  less,  but  the  analysis  of  variance  has  the  advantage  of  giving  numerical 
estimates  of  the  relative  importance  of  the  Between  Batch  and  Within  Batch 
variabilities,  and  of  the  exact  significance  of  the  existence  of  the  Between  Batch 
variability.  For  detailed  and  exact  work,  therefore,  the  analysis  of  variance  is  to 
be  preferred,  but  for  preliminary  and  exploratory  investigations  the  control  chart 
is  very  useful. 

An  important  proviso  in  the  use  of  control  charts  is  that  the  control  lines 
should  be  based  upon  at  least  10  and,  preferably,  20  batches  :  otherwise  the 
estimate  of  the  mean  range  w  is  not  very  accurate. 

(e)  The  Conversion  of  Range  to  Standard  Deviation 

It  frequently  happens  that  we  have  a  series  of  samples  of  n  individuals  drawn 
from  a  population  and  we  wish  to  know  their  standard  deviation.  We  can  of 
course  calculate  the  variance  for  each  sample  and  take  the  average  (since  the 
samples  are  all  of  n  individuals  the  simple  average  variance  will  be  the  same  as 
that  obtained  by  pooling  the  sums  of  squares  and  the  degrees  of  freedom).  The 
arithmetic  required  is  much  less,  however,  if  we  obtain  the  range  for  each  sample, 
calculate  the  average  range,  and  divide  this  average  range  by  the  factor  dn  given 
in  the  last  column  of  Table  V  in  the  Appendix,  selecting  that  dn  corresponding 
to  the  actual  size  of  sample.  The  resulting  figure  is  the  standard  deviation. 
The  standard  deviation  so  calculated  is  an  approximation  and  can  only  be  used 
when  the  number  of  samples  taken  is  reasonably  large,  say  not  less  than  20.  The 
estimate  is  then  assumed  to  have  nearly  infinite  degrees  of  freedom. 
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It  will  be  seen  that  except  when  the  sample  size  is  2  we  are  sacrificing  some 
accuracy  in  our  estimate  of  a,  the  compensating  gain  being  the  ease  of  computation, 
of  course.  This  loss  of  information  is  not  very  serious,  however,  being  less  than 
20%  for  samples  of  size  10  and  less  for  smaller  samples.  Under  the  conditions 
in  which  we  use  this  method  this  does  not  really  matter,  as  since  we  have  such  a 
large  sample  our  estimate  will  be  accurate  enough  for  almost  all  purposes. 

The  reader  will  note  that  it  is  this  factor  dn  which  is  the  basis  of  the  factor 
A'o-025  and  A'0.ooi  used  for  the  formulation  of  the  control  lines  for  means.  Thus 
it  is  clear,  that  for  a  sample  of  size  n,  control  lines  for  1  in  40  limits  (1  in  40  about 
the  upper  limit  and  1  in  40  below  the  lower  limit,  i.e.  1  in  20  outside  the  limits) 
should  be  at  1.96a/ yfi,  where  a  is  the  standard  deviation,  1.96  is  the  value  of 
t  for  infinite  degrees  of  freedom  (it  is  assumed  that  a  is  known  exactly  as  we  have 
supposedly  taken  a  large  number,  preferably  greater  than  20,  of  samples)  and  the 
5%  probability  level,  and  n  is  the  size  of  sample.  If  for  example  n  is  4,  andw  is 
the  mean  range,  then  the  limits  should  be  at  1.96(w/2. 059)7^/4  ~  0.476  w,  where 
2.059  is  the  value  of  dn  for  n  =  4.  It  will  be  seen  that  this  is  the  value  for  AV025 
given  in  Table  V  of  the  Appendix. 


CHAPTER  IX 
THE  RELATION  BETWEEN  TWO   VARIABLES 

(a)  Introduction  i1) 

A  problem  that  arises  frequently  is  to  determine  whether  an  apparent  relation 
between  two  variables  is  significant,  and  having  shown  it  to  be  significant,  to 
determine  the  best  form  of  representation. 

The  statistical  methods  available  for  dealing  with  this  problem  are  not  very 
satisfactory.  The  treatment  in  general  is  to  test  whether  the  data  can  be  repre- 
sented by  the  equation  for  the  simple  straight  line,  y  =  a  -f-  bx,  whether  the 
deviations  from  this  straight  line  are  significant  and  if  so  can  these  deviations  be 
represented  by  the  equation  y  =  a  -f-  bx  +  ex2  or  y  =  a  -f-  bx  -f-  ex2  +  dx3,  etc. 
In  practice,  the  computational  labour  is  excessively  heavy  for  any  of  the  steps 
but  the  first,  namely,  seeing  whether  the  data  can  be  approximately  represented 
by  a  straight  line  of  the  form  y  =  a  -f  bx.(2) 

It  is  clear  where  the  unsatisfactory  nature  of  this  treatment  lies  :  there  is 
no  a  priori  reason  why  a  complex  system  should  give  a  straight  line  relationship 
between  two  of  its  variables,  or  even  the  more  complex  relationship  y  =  a  -f-  bx 
-f-  ex2,  etc.  The  case  where  the  treatment  is  most  appropriate  is  where  the  data 
when  plotted  on  a  graph  has  the  appearance  of  an  elliptical  cloud,  and  we  are 
dubious  whether  this  ellipticity  is  genuinely  due  to  a  real  relationship  or  an  accident 
of  sampling.  Alternatively  there  might  be  little  doubt  about  the  genuineness  of 
the  existence  of  a  relationship  but  we  want  to  have  the  best  estimate  of  the 
relationship. 

If  the  data  when  plotted  on  a  graph  fall  on  a  smooth  curve  of  complex  shape 
then  the  present  techniques  are  not  applicable.  In  practice,  therefore,  we  should 
only  use  the  present  statistical  treatment  when  the  pairs  of  observations  on  the  two 
variables  plotted  on  a  graph  give  something  that  looks  approximately  a  straight  line. 

(x)  A  comprehensive  survey  of  the  field  is  given  bv  W.  E.  Deming  :  "Statistical  Adjust- 
ment of  Data"  (John  Wiley  :    Chapman  &  Hall),  1943. 

(2)  An  exception  is  where  the  independent  variable  is  chancing  in  equal  stages  :  c.f. 
Fisher's  "Statistical  Methods  for  Research  Workers,"  Sections  27  and  28.  Also,  G.  Egloff 
and  R.  C.  Kuder,  Journal  of  Physical  Chemistry  :  Vol.  46,  page  926,  1942  ;  and  L.  H.  C. 
Tippett  :    "Methods  of  Statistics"  (William  &  Norgate),  Section  9.  3. 
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(b)  Transformations 

We  may  have  a  priori  reasons,  from  the  theory  of  the  process,  to  suspect 
that  the  relationship  should  have  the  form 

Y  =  ABX 

which  is  equivalent  to  the  exponential  form 

Y  =  AeCx 

We  can  transform  this  to  a  linear  function  by  taking  logarithms, 

log  Y  =  log  A  +  x  log  B 
or  putting  log  Y  =  y,  log  A  =  a,  log  B  =  b,  we  now  have  the  relation  in  the 
convenient  linear  form. 

y  sbs  a  +  Dx- 
Also,  if  we  suspect  that  relationship  should  have  the  form  : — 

Y  =  AXB 

where  B  can  have  any  value,  e.g.  2,  when  the  relation  would  be 

Y  =  AX* 
we  can  take  logarithms 

Log  Y  =  log  A  +  B  log  X 
or  putting  log  Y  =  y,  log  A  =  a,  log  X  =  x,  we  again  have  the  relation  in  the 
convenient  linear  form 

y  =  a  +  Bx 
In  the  fitting  of  the  best  straight  line  to  data,  the  definition  of  the  best  line 
is  that  line  which  makes  a  minimum  the  sum  of  squares  of  the  deviations  from  the 
line  of  the  measurements  of  y  :  the  independent  variable  x  is  assumed  to  be  free 
of  error.  It  will  be  apparent  that  when  this  process  is  applied  to  a  pair  of  variables 
transformed  in  the  manner  indicated  above  what  is  being  minimised  is  not  the 
deviations  of  y  but  the  deviations  of  log  y.  In  practice  this  distinction  is  usually 
not  of  great  importance,  and  in  any  event  if  we  bear  it  in  mind  we  should  not 
be  led  into  faulty  conclusions. 

(c)  The  Correlation  Coefficient 

To  test  for  the  significance  of  an  apparently  linear  relation  we  calculate  the 
correlation  coefficient  r  defined  by  (2  denotes  summation  over  all  pairs  of 
observations) 

S(x-x)(y-y) 

V(2(x-x)»  S(y-y)') 

It  has  the  characteristics  such  that  if  the  relationship  between  the  data  can 
be  represented  exactly  by  a  straight  line  then  r  =  ±  1,  positive  if  the  straight 
line  has  a  positive  slope  and  negative  if  the  line  has  a  negative  slope  ;  if  on  the 
other  hand  there  is  no  relation  at  all  between  the  variables  then  r  =  0.  Even 
when  the  variables  may  have  no  relationship  we  will  usually  get  a  non-zero  value 
for  r,  since  the  accidents  of  sampling  will  often  lead  to  an  apparent  trend  one  way 
or  the  other.  We  can,  however,  test  whether  an  observed  value  for  r  is  larger 
than  would  have  been  obtained  accidentally  in  the  absence  of  a  correlation.  We 
enter  the  table  for  r  with  degrees  of  freedom  two  less  than  the  number  of  observa- 
tions (the  reason  for  it  being  two  less  than  the  number  of  pairs  of  observations  is 
that  we  have  in  effect  fitted  two  constants,  a  and  b,  to  the  data).  If  our  value 
for  r  exceeds  that  given  in  the  table  for  say,  the  5%  level  of  significance,  then 
we  may  regard  it  as  being  19  to  1  that  there  really  is  a  correlation. 

We  will  illustrate  the  use  of  the  statistical  treatment  of  data  suspected  of  being 
expressed  in  the  form  y  =  a  +  bx  with  the  data  partially  (there  are  actually 
30  pairs  of  observations)  given  by 


y 

1.66 

2.0 

1.66 

1.78 

1.89 

1.82  .  .  . 

.  .  .  2.07 

2.29 

X 

6 

8 

17 

16 

19 

16 

.  .  .13 

12 
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In  the  table,  y  represents  stick  weight  of  cordite  made  from  nitro-cellulose 
with  acetone  viscosities  x.  The  variables  have  been  transformed,  however,  to 
make  the  arithmetic  lighter ;  y  actually  represents  stick  weight  in  ounces  minus 
13  lbs.,1  oz.,  and  x  actually  viscosity  in  centipoises  minus  50.  On  plotting  y 
against  x  one  obtains  a  graph  with  a  wide  scatter  but,  apparently,  a  decided 
tendency  for  higher  viscosities  to  give  lower  stick  weights. 

We  obtain  the  following  terms  (n  =    number  of  pairs  of  observations). 

(Sx)2 


S(x-x)*  =  S(x2) 

=  (68  +  82  +  .  .  .  +  122) 
=  3213 
S(y-y)»=  2(y*)-(-Si8 


(6  +  8  +  .  .  •  +12)' 
30 


(1.66  +  2.00  +  .  .  .  +  2.29)> 


=  (1.662  +  2.002  +  .  .  .  +  2.292)  — 
=  9.060 

E(x  -  x)  (y  -  y)  =  S(xy)  -  ^ 

^(1.66x6  +  2.00x8  +  ...)-(1-66  +  2'°+30^6+8+- 

=  —47.84 

We  then  calculate  the  correlation  coefficient  r,  defined  as 
S(x  — x)(y-y) 


VS(x— x)2  Z(y— y)2 
—47.84 


V  9.060  x  3213 


=  —0.280 


We  see  from  the  table  for  r  (Table  IV  in  the  Appendix)  that  the  probability 
of  getting  such  a  value  for  r  in  the  absence  of  any  correlation  is  about  0.10,  that 
is  to  say  1  in  10  times  we  could  get  as  large  a  value  for  r  as  we  did  here  even  in 
the  absence  of  a  correlation.     The  evidence  for  the  correlation  is  thus  inadequate. 

(d)  The  Equation  for  the  Regression  Line 

If  r  proved  to  be  significantly  greater  than  zero  we  would  take  the  matter  a 
little  further :  although  in  the  example  considered  here  r  was  not  significantly 
greater  than  zero  we  will  use  its  data  to  illustrate  the  point. 

If  we  wished  to  use  the  data  to  predict  values  of  y  from  known  values  of  x 
we  would  calculate 

a  =  S  =  y  =  49.58/30  =  1.65 

_  S(y-y)(x-x)  _  -47.84  _ 
b  ~         S(x-x)2         -  "3213"  -  ~°-°149 
Sx        670 


-22.3 
n  30 

The  formula  for  deducing  the  most  probable  value  of  y  corresponding  to  a 
given  value  of  x  (technically  known  as  the  regression  of  y  upon  x)  is  then 
y  =  a  +  b  (x  —  x) 
=  1.65  —  0.0149  (x  —  22.3) 
=  1.98  —  0.0149  x 
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This  regression  line  of  y  upon  x  is  the  line  which  gives  the  squares  of  the 
deviation  of  every  point,  measured  parallel  to  the  y-axis  in  units  of  y,  summed 
for  all  points,  as  a  minimum.  It  gives  the  best  estimate  of  y  from  a  known  value 
of  x. 

If  we  wished  to  predict  the  value  of  x  from  a  known  value  of  y  we  would 
use  the  similar  but  different  equation  derived  by  calculating 

n 

h,  _  s(y  —  y)  (x  —  x) 

S(y  -  y)a 
-  __  s(y) 

^  n 

leading  to  the  equation 

x  =  a'  +  b'  (y  -  y) 

This  line  gives  the  deviations  of  every  point,  measured  parallel  to  the  x-axis  in 
units  of  x,  summed  for  all  points,  as  a  minimum.  It  gives  the  best  estimate  of 
x  from  a  known  value  of  y. 

Non-statisticians  frequently  object  to  the  concept  of  two  "best"  lines,  two 
regression  lines.  They  hold  that  there  must  be  one  line  which  represents  the 
"true"  relation.  That  may  be  so,  but  we  are  not  concerned  with  "true"  relations  : 
we  are  concerned  with  making  the  best  possible  estimate  of  y  from  a  known  value 
of  x,  using  a  certain  amount  of  experimental  data  to  estimate  the  relationship. 
In  one  case  we  assume  that  we  know  x  and  wish  to  estimate  y,  in  the  other  case 
we  assume  that  we  know  y  and  wish  to  estimate  x.  These  are  two  different 
operations,  so  it  is  not  surprising  that  we  use  two  different  functions. 


(e)  The  Residual  Variance  about  the  Regression  Line 

We  assume  that  we  now  have  our  regression  line  of  y  upon  x  with  the  points 
scattered  about  on  either  side  of  it.  The  standard  deviation  ar  of  this  scatter, 
measured  in  units  of  y  parallel  to  the  y-axis,  is  given  by 

=  ]/l-  0.280'  x    y^-6  =  0.546 

This  figure  affords  a  good  measure  of  the  usefulness  of  the  equation  for  the 
regression  line  for  predicting  values  of  y  for  given  values  of  x. 

Thus  suppose  we  wish  to  draw  95%  confidence  limits  on  either  side  of  the 
regression  line,  within  which  95%  of  all  points  should  lie,  we  look  up  the  value 
of  t  for  N  —  2  =  28  degrees  of  freedom  and  the  5%  level  of  significance,  and 
calculate  t  ar  —  2.05  X  0.546  =  1.12.  We  then  draw  in  two  lines  parallel  to 
the  regression  line,  but  one  displaced  1.12  units  of  y  downwards  and  one  displaced 
1.12  units  of  y  upwards.  Thus  if  we  use  the  regression  line  to  predict  the  value 
of  y  from  a  known  value  of  x,  these  confidence  limits  give  us  the  limits  between 
which  we  have  a  95%  chance  of  being  correct  in  our  prediction. (3) 

(f)  The  Use  of  the  Analysis  of  Variance  for  Examining  Regression 

An  alternative  method  of  testing  the  significance  of  a  regression  is  by  analysing 
the  variance  of  the  dependent  variable  y  into  its  component  factors. 

(3)  This  statement  is  approximate  :    the  exact  treatment  follows  in  Section  (h) 
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The  total  sum  of  squares  of  y  is  Z(y  —  y)a  =  9.060.     The  sum  of  squares 
accountable  for  by  the  regression  line  is 

[S(y-y)(x-x)]'  _  47,84"  _ 

2(x-x)>  _    3213   -  °JlZi 

We  then  enter  these  into  a  table  of  analysis  of  variance  as  below : 


Source  of 
Variance 

Sums  of 
Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Regression 
Residual 

0.7123 
8.3477 

1 

28 

0.7123 
0.2981 

Total 

9.060 

29 

The  degrees  of  freedom  are  1  for  the  regression  line,  one  less  than  the  total  number 
of  pairs  of  observations  for  the  Total,  and  the  residual  is  the  difference  between 
these  two.  Similarly  the  residual  sum  of  squares  is  obtained  as  the  difference 
between  the  Total  and  the  Regression. 

The  significance  of  the  Regression  Line  is  estimated  by  comparing  its  mean 
square  with  the  residual  in  the  usual  way.  Here  it  lies  between  the  20%  and 
5%  levels. 

The  residual  is  the  same  as  the  residual  variance  about  the  regression  line 
that  we  calculated  earlier  in  the  previous  section  ;   thus  the  square  root  of  this 

residual  variance  is  V0.2981  =  0.546  as  obtained  earlier. 

It  will  be  noted  that  the  results  obtained  by  the  present  approach  are  identical 
with  those  found  by  calculating  the  correlation  coefficient. 


(g)  Comparison  of  Regression  Coefficients 

We  might  wish  to  test  whether  two  regression  coefficients  were  significantly 
different.  Thus  we  might  be  conducting  experiments  upon  some  system,  (a) 
with  clockwise  stirring  and  (b)  with  anti -clockwise  stirring.  We  might  plot 
graphs  for  the  two  series  and  suspect  that  they  were  different. 

We  can  make  this  test  by  using  the  fact  that  the  variance  of  a  regression 
coefficient  is  given  by 

Residual  Variance  of  y 
2(x-x)» 

i.e.  in  the  present  instance  0.2981/3213.  The  standard  deviation  is  the  square 
root  of  this,  namely  0.0096,  with  two  less  than  the  total  number  of  observations, 
namely  28,  as  its  degrees  of  freedom. 

For  comparing  two  regression  coefficients  bx  and  b2  which  have  residual 
variances  about  the  regression  lines  of  ctx2  and  <r22  with  degrees  of  freedom  nx  and 
n2  we  form  the  weighted  average  residual  variance 

°2  =  (n^2  +  n2  a22)/(ni  +  n2) 
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The  variance  of  bx  is  then 

*»/$,  (x  -  x)2 

where  D^x  —  x)2  is  the  sum  of  the  squares  of  the  deviations  of  x  for  the  first 
regression  coefficient,  and  similarly  for  b2.     The  variance  of  their  difference  is  then 


=  {b3 


x)2    '     E2(x  — x)2 
b2)/CTi2>  ajlc^  tne  degrees  of  freedom  for  this  t  are 


We  then  calculate  t 
(nx  +  n2). 

An  alternative  method  of  making  the  comparison  between  two  regression 
coefficients,  which  at  the  same  time  allows  the  comparison  of  the  intercepts  on 
the  y  —  axis,  is  with  the  analysis  of  variance.  The  table  below  gives  the  results 
of  an  experiment  on  the  measurement  of  a  reaction  velocity  (y).  We  get  this 
measured  at  four  different  concentrations  of  a  catalyst  (x)  and  two  different  con- 
centrations of  an  emulsifier  (z). 


Emulsifier 
Concentration 

Catalyst  concentration  (x) 
0             3             6             9 

Total 
18 

%1 

z2 

100        230        301        506 
177        324        360        532 

1137 
1393 

The  analysis  of  variance  of  y,  the  reaction  velocity,  proceeds  quite  logically. 
The  total  variance  for  all  observations  with  8  —  1  ==  7  degrees  of  freedom  is 

1002  +  2302  +  . .  +  5322  —  (1137  +  1393)2/8  =  158353.50 

Considering  the  line  with  the  emulsifier  concentration  at  zx,  the  total  variance  is 
1002  +  2302  +  3012  +  5062  —  11372/4  =  86344.750 

To  determine  the  variance  accounted  for  by  the  best  regression  line  for  zx, 
we  use  the  method  of  Section  (f).     We  need 

S(x  —  x)2  =  Ex2  —  (Sx)2/N 

=  02  +  32  +  62  +  92  -r-  182/4  ==  45.00 
E(y  —  y)  (x  —  x)  =  Sxy  —  Ex  Ey/N 

=  0  x  100  +  3  x  230  +  6  x  301  +  9  x  506  — 

18  x  1137/4 
=  1933.50 
and  the  sum  of  squares  accounted  for  by  the  regression  line  is  then  1933.502/45  == 
83076.05.         The  residual  variance  about  this  regression  line  is  (86344.75  — 
83076.05)  =  3268.70. 

A  similar  set  of  operations  for  the  best  regression  line  for  z,  gives  us  as  the 
sum  of  squares  accounted  for  by  this  regression  line  60610.05  and  the  residual 
variance  about  it  3206.70. 

Accordingly  we  can  enter  in  the  table  of  analysis  of  variance  (Table  9.1)  the 
residual  sum  of  squares,  (3268.70  -f-  3206.70)  =  6475.40.  Each  separate  regres- 
sion line,  based  on  4  observations,  will  have  2  degrees  of  freedom  for  its  residual, 
so  together  we  have  4. 
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We  now  need  to  consider  what  happens  when  we  ignore  the  distinction  between 
zx  and  z2.     We  have 

S(z  —  x)2  -  02  +  02  -f  62  +  62  +  ....  —  (2  x  18)2/8  =  90 
S(x  —  x)  (y  —  y)  =  0  x  100  -f -  0  x  177  +  . .  —  (2  x  18)  (1137  +  1393)/8 
=  3585.00 

The  sum  of  squares  accounted  for  by  the  best  single  line,  with  1  degree  of  freedom, 
is  then  3585.002/90  =  142802.50,  and  this  is  entered  in  the  table. 

The  improvement  due  to  the  use  of  the  two  separate  lines  in  place  of  the 
single  line  is  the  difference  between  this  and  the  sum  of  the  two  sums  of  squares 
for  the  separate  regressions  : — 

142802.50  —  (83076.05  +  60  610.05)  =  883.60 

This  has  1  degree  of  freedom. 

There  is  1  degree  of  freedom  not  yet  accounted  for  ;  this  and  its  sum  of 
squares,  found  by  difference,  are  attributable  to  the  difference  in  intercepts. 

Table  9.1 


Source  of  Variance 

Degrees  of 
Freedom 

Sums  of 
Squares 

Mean 
Squares 

Common  Regression  Line 
Difference      between      Common 

Line  and  Separate  Lines 
Difference  in  Intercepts    .  . 
Residual 

1 

1 
1 
4 

142802.50 

883.60 
8192.00 
6475.40 

142802.50 

883.60 
8192.00 
1618.85 

Total 

7 

158353.50 

Using  the  variance  ratio  test,  it  is  obvious  that  there  is  no  evidence  for  a 
difference  between  the  two  regression  coefficients.  For  the  difference  between 
intercepts,  the  variance  ratio  of  5.06  does  not  reach  the  5%  level  of  significance. 
The  difference  between  the  intercepts  is  therefore  not  established,  and  in  the 
absence  of  further  data  we  could  regard  the  best  joint  regression  line  as  an  adequate 
fit.  On  the  other  hand,  if  we  had  reason  to  suppose  that  the  lines  would  be  different 
there  is  nothing  to  stop  us  using  the  separate  lines. 


(h)  Exact  Formula  for  the  Residual  Variance  about  the  Regression 
Line 

As  stated  earlier,  the  formula  given  in  Section  (e)  for  the  residual  variance 
about  the  regression  line  is  not  quite  exact.  If  or2  is  the  residual  variance  as 
already  defined,  and  at>2  is  the  variance  of  the  regression  coefficient  itself,  calculated 
as  in  Section  (g),  then  the  exact  formula  (4)  for  a  12  is 

ari2  =  (ar2  +  X2ab2) 

where  X  is  the  distance  in  units  of  x  from  the  midpoint  of  x,  x.     For  example, 

(«)H.  Schultz,  Journal  of  the  American  Statistical  Association,  Vol.  XXV,  page  139, 
1930,  gives  a  full  treatment  of  this  and  the  more  complicated  cases  of  parabolas,  etc. 
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in  the  case  we  have  been  considering,  x  is  quoted  in  section  (d)  as  22.3.  In 
section  (e)  S(x  —  x)2  is  quoted  as  3213,  whence  ax2  =  110.79.  We  have  ax  of 
the  order  of  10,  and  thus  to  select  a  point  towards  one  end  of  the  x  scale  we  can 
take  x  +  2ax  =  22.3  +  2x  10  =  42.3  :  this  corresponds  to  X  =  20.  Inserting 
the  appropriate  values  for  ar  and  ab  we  get 

ari2  =  (0.2981  +  202  x  0.00009278) 

-  0.3352 
whence        a  i    =  0.5790 

r 

In  the  present  instance,  it  is  clear  that  the  use  of  the  approximate  formula 
would  have  given  results  close  to  the  truth,  and  of  course  the  nearer  we  are  to 
the  midpoint  of  x  the  smaller  is  X  and  the  less  important  the  correction. 

(i)  The  Use  of  the  Analysis  of  Variance  for  Checking  Linearity 

A  problem  arising  frequently  is  to  determine  whether  a  set  of  observations 
which  to  the  first  approximation  can  be  represented  by  a  straight  line  can  be 
said  to  depart  significantly  from  the  straight  line.  This  question  can  be  readily 
settled  if  the  data  is  in  the  symmetrical  form  suitable  for  the  application  of  the 
analysis  of  variance. 

Consider  the  data  below : — 


X 

10 

20 

30 

40 

y 

92.8 
93.0 

94.0 
94.3 

95.1 
94.8 

94.9 
94.8 

y 

92.9 

94.15 

94.95 

94.85 

We  can  regard  x  as  representing  the  temperature  in  °C  of  a  certain  part  of 
a  process  and  the  two  values  of  y  repeat  determinations  on  the  yield  at  that  tem- 
perature. We  are  interested  in  the  effect  of  temperature  upon  the  yield.  To  the 
first  approximation  the  yield  y  appears  to  increase  linearly  with  the  temperature 
x,  but  the  value  for  x  =  40  is  less  than  that  for  x  =  30.  We  shall  test  whether 
this  apparent  departure  from  linearity  is  significant. 

We  first  transform  the  variables  to  give  us  smaller,  and  hence  easier,  numbers 
to  work  with.  Thus  we  can  use  X  =  x/10,  and  Y  =  (y  —  92)  x  10.  The  data 
then  becomes  : 


X 

1 

2 

3 

4 

Y 

8 
10 

20 
23 

31 

28 

29 

28 

We  can  in  the  first  instance  analyse  the  variance  of  the  data  into  between  columns 
(given  values  of  X)  and  within  columns. 

Proceeding  on  these  lines,  we  calculate  the  terms 

(1)  =  8*  +  102  +  .  .  .  +  282  =  4463 

(2)  =  [(8  +  10)2  +  (20  +  23)2  +  (31  +  28)2  +  (29  +  28)2]/2  =  4451.5 

(3)  =  (8  +  10  +  20  +  .  .  .  +  28)2/8  =  3916.1 
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We  enter  these  terms  into  a  table  as  below  : — 


Source  of  Variance 

Sums  of 
Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Between  Temperatures 
Within  Temperatures 

(2)  ~(3)  = 
535.4 

0)-(2)  = 
11.5 

3 
4 

178.5 
2.87 

Total 

0)-(3)  = 
546.9 

7 

Testing  the  Between  Temperature  Mean  Square  against  the  Within  Tem- 
perature Mean  Square,  we  see  that  it  is  significant. 

We  now  divide  the  Between  Temperature  Sum  of  Squares  into  that  due  to 
the  linear  regression  line  and  that  due  to  departure  from  it. 

We  calculate 

S(X)  =2(l  +  2  +  3  +  4)  =  2x  10  =  20 

S(X2)  =  2  (l2  +  22  +  3*  +  42)  =  2  x  30  =  60 

S(X  —  X)2  =  60  —  202/(4  x  2)  =  10 

S(XY)         =  1  (8  +  10)  +  2(20  +  23)  +  3  (31  +  28)  +  4(29  +  28)  =  509 

S(Y  —  t)  (X  —  £))  =  S(XY)  —  S(X)  S(Y)/N 
=  509  —  20  x  177/8  =  66.5 
The  factor  2  in  the  expressions  for  S(X)  and  E(X2)  arises  through  there  being 
2  observations  on  Y  for  each  level  of  X. 


The  sum  of  squares  attributable  to  the  regression  line  is  then 
[S(Y-?)(X-X)]»  _  66.5'  _ 

s(x  -  ±y 


Since  the  total  sum  of  squares  attributable  to  differences  between  temperatures 
is  535.4,  the  sum  of  squares  due  to  deviations  from  the  regression  line  is 

535.4  —  442.2  =  93.2 
The  degrees  of  freedom  for  the  regression  line  are  1  and  for  the  deviations  from 
it  the  remainder  of  those  belonging  to  differences  between  temperatures. 

We  enter  up  these  expressions  in  a  table  as  below : 


Source  of  Variance 

Sums  of 
Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Between  Temperatures  : 

Regression 

Deviation  from  Regression 
Within  Temperatures 

(Residual) 

442.2 
93.2 

11.5 

1 

2 

4 

442.2 
46.6 

2.87 

Total 

546.9 

7 
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The  general  significance  of  the  regression  line  can  be  tested  by  comparing  the 
Regression  Mean  Square  with  the  Residual.  Here  it  is  clearly  highly  significant. 
To  test  whether  the  departure  from  the  straight  line  is  significant,  we  calculate 
the  variance  ratio  for  the  Deviation  from  Regression  as  46.6/2.87  =  16.2  for 
degrees  of  freedom  n1  =  2,  n2  =  4.  This  is  nearly  as  significant  as  the  1%  level, 
so  we  may  take  it  with  that  degree  of  confidence  that  the  data  cannot  be  com- 
pletely represented  by  a  straight  line. 


The  best  estimate  of  the  regression  coefficient  of  Y  upon  X  is  given  by 
r  _  ?)(X— X)]  _  66.5 
S(X— X)2  ""     10 


i^-m-ifl_tts 


(j)  The  Calculation  of  Correlation  Coefficient,  etc.,  from  Grouped 
Data 

If  the  number  of  pairs  of  observations  is  large,  say  about  100,  the  labour  in 
calculating  the  correlation  coefficient  as  in  Section  X  (c)  becomes  considerable, 
and  can  be  lightened  by  grouping  the  data  on  the  same  general  lines  as  in  Chapter 

ii  (j)- 

We  divide  each  axis  into  about  10  or  20  equal-sized  groups,  and  count  up 
the  number  of  observations  occurring  in  each  cell. 

The  data  in  Table  9.2  refers  to  a  manufacturing  process  involving  the  form- 
ation of  an  emulsion  in  water.  The  independent  variable  x  is  a  function  of  the 
quality  of  the  water  (hardness)  and  the  dependent  variable  y  is  a  quality  of  the 
final  product.  Table  9.2  is  a  graph  with  Ax  representing  the  x-axis  and  Ay  the 
y-axis.  For  each  set  of  values  of  Ax  and  Ay,  i.e.  for  each  point  on  the  graph, 
the  number  gives  the  number  of  observations  occurring  with  those  values  (when 
none  occur  then  the  space  is  left  blank).  The  column  headed  B  gives  the  frequency 
with  which  the  corresponding  values  have  occurred.  Thus  to  take  Ax  =  10,  it 
will  be  seen  that  there  are  points  with  Ax  =  10  at  Ay  =  23,  17,  and  16,  so  the 
corresponding  entry  for  Bx  is  3.  C  is  the  arbitrary  scale  with  shifted  origin  used 
to  lighten  the  arithmetic.  D  and  E  are  the  terms  B  X  C  and  B  X  Ca  respectively. 
The  total  number  of  observations  is  given  by  the  sum  of  the  frequencies  in  Column 
B,  here  101. 

Thus  we  have 

(Sx)« 


S(x  —  x)2  =    Z(x2) 


N 


=  915  — 2|D2  =  908.8 

E(y-y)2    =    2(y)*_LM2 

=  875  —  ^  =  856.7 

The  calculation  of  the  term  2(x  —  x)  (y  —  y)  is  rather  more  difficult.  For 
each  level  of  Cy,  we  multiply  the  frequency  for  each  cell  by  its  value  for  Cx,  and 
sum  for  all  cells  at  this  level  of  Cy.  This  is  the  column  headed  F.  Thus  for 
Cy  =  — 3,  we  have  three  cells,  the  first  with  unit  frequency  at  Cx  =  —  3,  the 
second  with  frequency  2  at  Cx  =  — 2,  the  third  with  unit  frequency  at  Cx  =  1. 
Thus  F  =  1  X  (—3)  +  2  X  (—2)  +1x1=  —6.  As  a  check  upon  the  accur- 
acy of  column  F  its  sum  should  equal   £x.     Column  G  is  then  obtained  as  the 
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product  of  Cy  and  F.     Its  sum  is  2(xy).     To  correct  this  into  terms  of  deviations 
from  the  means  rather  than  the  arbitrary  zeros  we  subtract  the  correcting  factor 
Sx  Sy        (-25)  x  (+  43) 
"~~N~  ~  101  -  _ia64 

We  thus  obtain 

2(x  —  x)  (y  —  y)  =  399.0  —  (—10.6) 
=  409.6 
The  correlation  coefficient  is  then 

S(x  —  x)  (y  —  y) 


R   = 


Vl(x-x)2  E(y  —  y)2 
409.6 


V(908.8  X  856.7) 
=  0.466 
which  for  99  degrees  of  freedom  is  much  more  significant  than  the  0.1%  level 
of  significance.     The  regression  coefficient  of  y  upon  x  can  be  calculated  as  before 
as 

S(x  -  x)  (y  -  y) 

S(x  —  x)2 
409-6        HK1 

=  9oo  =  0;451 

and  the  regression  equation  then  is 

y  =  &  +  b  (x  -  §■)  =  0.451  x  +  0.537 

(k)  Correlation  and  Causation 

A  significant  correlation  coefficient  can  be  taken  as  an  indication  of  associa- 
tion between  two  variables,  but  it  is  important  to  realize  that  this  does  not  auto- 
matically imply  causation. 

Thus  for  example  suppose  we  have  plant  data  stretching  over  a  period  of 
some  months,  and  we  find  that  the  temperature  Tx  at  a  certain  stage  varies  quite 
considerably.  We  may  plot  Tx  against  the  dependent  variable  y,  obtain  what 
seems  a  linear  relation,  calculate  the  correlation  coefficient  and  find  it  significant. 
At  this  stage  it  would  be  dangerous  to  assume  that  y  is  necessarily  a  function  of 
Ti,  for  it  may  be  that  the  real  operating  variable  is  the  temperature  T2  in  some 
other  part  of  the  process,  and  that  it  happens  that  Tx  has  been  related  to  T, 
through  some  common  operating  factor  such  as  weather. 

This  type  of  error  is  always  liable  to  occur  when  our  data  is  existing  plant 
records  which  we  observe.  It  is  less  likely  to  occur  if  the  data  is  obtained  by 
actual  experiment.  Thus  in  an  experiment  if  we  raise  the  temperature  Tx  to  a 
chosen  value,  and  the  running  of  the  plant  as  measured  by  the  dependent  variable 
y  (which  may  be  yield,  quality,  etc.)  improves,  then  for  practical  purposes  it  may 
be  sufficient  to  say  "run  the  plant  with  Tj  at  this  value."  However,  this  still 
may  not  be  real  evidence  of  the  causative  effect  of  Tx  upon  y,  for  the  plant  may 
be  such  that  in  raising  Tx  to  the  new  value  we  have  at  the  same  time  affected 
the  real  causative  variable  T2.  Thus,  for  example,  in  a  counter-current  process, 
raising  the  temperature  at  one  point  will  probably  affect  the  temperature  at  other 
points. 

The  considerations  outlined  in  the  last  three  paragraphs  do  not  in  any  way 
invalidate  the  desirability  of  the  testing  for  significance  of  any  apparent  relation. 
They  merely  imply  that  having  shown  a  correlation  to  be  significant,  caution  is 
necessary  in  assuming  that  this  association  is  evidence  of  a  causative  effect  of  one 
variable  upon  the  other. 
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(1)  Conclusions 

Where  it  is  suspected  that  the  relation  between  two  variables  is  approximately 
linear,  there  are  three  statistics  which  between  them  summarise  the  most  important 
properties  of  the  available  data  : — 

(a)  the  correlation  coefficient,  with  the  associated  number  of  degrees  of 
freedom,  which  is  a  convenient  measure  of  the  degree  of  reliability  of 
the  association. 

(b)  the  regression  coefficient  b  that  measures  the  slope  of  the  regression 
line,  i.e.  the  average  increase  in  y  per  unit  increase  in  x,  and 

(c)  the  residual  variance  ar  that  measures  the  scatter  of  values  of  y  about 
the  regression  line,  i.e.  the  reliability  of  estimates  of  y  for  given  values 
of  x  when  estimated  from  the  regression  line. 

(d)  If  the  experiment  has  been  executed  with  appropriate  replication,  it  is 
often  desirable  for  the  analysis  of  variance  giving  the  significance  of 
departure  from  linearity  to  be  applied. 

In  using  a  regression  equation  it  must  be  always  remembered  that  it  is  only 
valid  over  the  range  of  the  independent  variable  which  occurred  in  the  data  used 
in  calculating  it.  Extrapolation  is  most  unwise,  except  when  there  is  a  very 
sound  theoretical  basis.  It  is  good  practice  to  quote  the  range  used  of  the 
independent  variable  in  order  to  discourage  extrapolation. (B) 


CHAPTER  X 
MULTIPLE  CORRELATION 

(a)  Introduction 

We  discussed  in  Chapter  IX  the  determination  of  the  relationship  between 
a  dependent  and  an  independent  variable  In  many  of  the  cases  which  arise  in 
practice,  however,  we  find  that  there  is  more  than  one  independent  variable  which 
may  be  of  practical  importance.  In  these  circumstances  it  becomes  necessary  to 
isolate  the  effect  of  each  independent  variable  from  the  effects  of  the  other 
independent  variables. 

In  the  example  we  shall  consider  the  stack  loss  from  a  Pressure  Oxidation 
Plant  for  making  nitric  acid  (about  55% — 60%)  by  the  oxidation  of  ammonia 
with  air.  The  resulting  nitric  gases  are  absorbed  by  being  passed  upwards  through 
a  bubble  cap  absorption  column,  nitric  acid  being  circulated  down  the  column. 
The  absorption  is  exothermic,  i.e.  gives  rise  to  the  liberation  of  heat,  and  the 
temperature  tends  to  rise.  The  absorption  of  the  nitric  gases  is  less  efficient  at 
higher  temperatures,  so  to  keep  the  temperature  down  water  is  circulated  through 
cooling  coils  in  the  column.  The  column  also  tends  to  cool  itself  by  radiation, 
etc.,  to  its  surroundings.  A  small  quantity  of  the  nitric  gases  fails  to  be  absorbed 
and  escapes  by  the  stack  :  this  we  will  regard  as  the  dependent  variable.  Its 
minimisation  is,  of  course,  of  prime  economic  importance  and  we  will  first  deter- 
mine its  dependence  upon  the  two  independent  variables  air  temperature,  x8, 
and  cooling  water  temperature,  xw.  In  a  subsequent  section  we  will  consider  a 
third  independent  variable,  strength  of  the  nitric  acid  being  circulated  in  the 
absorption  column. 

(b)  Two  Independent  Variables 

We  will  determine  the  dependence  of  the  P.O.P.  stack  loss  (to  be  denoted 
by  y)  upon  the  cooling  water  temperature,  xw,  and  the  air  temperature,  x*. 

(6)  A  full  discussion  of  various  aspects  of  correlation  is  to  be  found  in  Chapters  11  to  16 
of  Yule  and  Kendall's  "An  Introduction  to  the  Theory  of  Statistics"  (Charles  Griffin). 
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Accordingly  data  was  collected  giving  corresponding  values  for  these  three 
variables  for  139  days. 

It  was  assumed  that  the  stack  loss  was  to  a  first  approximation  related  linearly 
to  the  two  independent  variables,  so  that  it  could  be  represented  by  an  equation 
of  the  form 

y  =  ay.wa  +  byw-a  (xw  —  xw)  +  bya.w  (xa  —  xa) 
The  meaning  of  the  symbols  is  as  follows  : — 

ay.Wa  is  the  value  y  would  have  if  xa  and  xw  were  held  constant  at  their  mean 
values. 

byw.a  measures  the  rate  of  increase  of  y  with  increasing  xw,  xa  being  constant. 

bya.w  measures  the  rate  of  increase  of  y  with  increasing  xa,  xw  being  constant. 

If  we  had  considered  it  more  appropriate,  we  could,  of  course  have  used  an 
equation  of  the  type 

y  =  ay.wa  +  byw.a  (xw2  — ;  Xw2)  +  bya.w  (Xa2  —  Xa2) 

which  would  be  based  on  the  assumption  that  the  stack  loss  y  increased  as  the 
square  of  the  temperature,  or  in  the  first  of  the  above  equations  we  could  have 
substituted  for  the  temperature  the  reciprocal  of  the  absolute  temperatures,  or 
whatever  functions  we  considered  most  likely  to  represent  the  true  behaviour  of 
the  system. 

Taking  the  straightforward  linear*  equation  we  will  determine  the  values  of 
these  three  coefficients  which  give  the  best  general  fit  to  the  data.  We  proceed 
as  follows. 

It  makes  the  computation  lighter,  particularly  if  we  are  not  using  a  calculating 
machine,  to  use  arbitrary  zeros  and  to  multiply  by  10  or  100,  etc.,  if  necessary, 
to  remove  the  decimal  points.  Thus  in  the  present  instance  the  variables  used 
are  actually : — 

y    =  10  (stack  loss  in  gm/M3  —  3.0). 

xw  =  water  temperature  in  °C  —  20°C. 

xa  =  air  temperature  in  °F  —  50°F. 

For  each  set  of  observations  on  the  three  variables,  y,  xw,  and  xa,  we  form  the 
three  pairs  of  products,- yxw,  xwxa>  and  xay.  Thus  we  have  a  table  of  139  sets 
as  below : — 


Date 

y 

Xw 

Xa 

XwXa 

xay 

xwy 

30.12.43 

29 

28 

27 

25 

22 

22 

21 

18 

—20 

—  9 
—18 

—  6 

—  3 

—  5 
—20 
+  19 

—  4 

—  6 
1 
1 

—  1 
2 

—  5 
2 
5 

—  1 

—11 

—  3 

—  2 

—  1 

—  3 
—10 
—20 
+  14 
—14 

66 

—  3 

—  2 
1 

—  6 
50 

—40 
70 
14 

220 

27 

36 

9 

6 

50 

400 

266 

56 

120 

—  9 
—18 

6 

—  6 
25 

—40 

+95 

4 

1479 

406 

1 

3533 

17340 

12797 

We  require  the  sums  of  the  six  columns  as  shown  in  the  last  row  of  the  table. 
We  will  denote  a  summation  by  the  letter  S.     Thus,  e.g.   L(xwxa)  is  the  sum  of 
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the  139  XwXa's,  here  3533.     We  also  require  the  three  sums  of  the  squares  of  the 
individual  observations  on  the  three  variables.     We  thus  have  : — 

Ey       =    1,479  2y2     =  99,077 

Sxw    =       406  Exw2   =    3,456 

Sxa     =  1  Sxa2    =  10,525 

Syxw  =  12,797  Eyxa   ==  17,340  2xwxa  =  3,533 

So  far  the'se  sums  of  squares  and  sums  of  products  refer  to  deviations  from 
the  arbitrary  zeros.  We  need  to  convert  these  into  deviations  from  the  means 
with  the  usual  type  of  correcting  factors.     Thus,  e.g.  : — 

S(y-y)2  =   S(y')-(-|i-2 

14792 

-a  99  077  —  — — 
vv,u//         139 

=  83,340.0 

and  S(y  -  y)  (xw  -  xw)  =  2(yxw)  -  (Sy^Sxw) 

_12  797        1479  x  406 

-  lA/*/  139 

=  8,477.0 
In  a  similar  manner  we  obtain  the  other  terms  as  deviations  from  the  means. 
In  order  to  have  our  symbolism  more  compact  we  will  denote  a  summation  of 
deviations  from  the  means  by  E'.     Thus 
£(y  —  y)  (Xw  —  xw)  =  S'yxw 
and   2(y  —  y)2  =  S'y2. 
We  thus  have 

E'y2  =  83,340.0  :    S'xw2  =  2270.1  :    S'xa2  =  10,525.0 
S'yxw  =  8,477.0  :    S'yxa  =  17,329.4  :    S'xwxa  ==  3,530.1 
To  obtain  the  two  regression  coefficients  byw.a  and  bya.w  we  have  to  solve 
the  two  simultaneous  equations 

S'yXw  =  byw.a  .    S'Xw2  +  bya.w  .    L'XwXa 
S'yXa  =  byw.a  .    S'xwXa  +  b  ya-w.    S'xa* 

Substituting  our  values,  we  obtain 

8,477  =  byw.a  .  2270  +  bya.w  .  3530  (A) 

17,329  =  byw.a  .  3530  +  bya.w  .  10525  (B) 

To  solve  these  two  simultaneous  equations  we  can,  e.g.,  multiply  (A)  by  the 
ratio  10525/3530  =  2.9816  and  obtain 

25,275  =  byw.a  .  6768  +  bya.w  .  10525  (C) 

and  subtracting  (B)  we  obtain 

byw.a  =  7946/3238  =  2.4540 
To  obtain  the  other  regression  coefficient  we  substitute  this  value  for  byw.a  in, 
say,  (A)  and  find 

8477  =  5570.6  +  bya.w  .  3530 
whence 

bya.w  =  0.8234 
Our  equation  for  estimating  y  is  thus 

(y  —  y)  =  2.4540  (xw  —  xw)  +  0.8234  (x,  —  xa) 
y  —  10.6403  =  2.4540  (xw  —  2.9209)  +  0.8234  (xa  —  0.0072) 
or     y  =  3.4665  +  2.4540  xw  +  0.8234  xa. 
Before  proceeding,  it  is  as  well  to  test  the  significance  of  the  two  regression  co- 
efficients byw.a  and  bya.w.     This  we  can  do  with  an  application  of  the  analysis  of 
variance. 

We  first  want  the  total  Sum  of  Squares  of  the  dependent  variable  y.  This 
is  obviously  S'y2  =  83,340: 
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We  then  calculate  the  Sums  of  Squares  of  y  explained  by  the  two  independent 
variables  independently  of  each  other.     These  are  ' 

(S'yxw)'        8477' 
S'xw2  2270 


=  31,656.2 


-  'SP.-asF-'wu 

for  xw  and  xa  respectively.     We  enter  the  larger  of  the  two  in  the  first  line  of  the 
table  below. 

We  next  calculate  the  sums  of  squares  of  y  explained  by  the  two  independent 
variables  together.     This  sum  of  squares  is 
byw.a  S'yxw  +  bya.w  2'yxa 
=  2.4540  x  8477  +  0.8234  X  17,329 
=  35,071 
Then  the  sum  of  squares  explained  by  the  addition  of  the  second  independent 
variable  xa  is  the  difference  between  the  sum  of  squares  explained  by  xw  and 
xa  together  and  the  sum  of  squares  explained  by  xw  alone,  i.e. 

35,071.0  —  31,656.2  =  3414.8 
We  finally  enter  up  these  various  sums  of  squares  in  a  table  of  analysis  of  variance 
as  below. 

Table  10.1 


Source  of  Variance 

Sums  of 
Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Variation  explained  by  xw 
Increment  explained  by  the 

addition  of  xa 
Total  explained  by  xw  and  xa 

together 
Residual 

31,656 
3,415 

35,071 

48,269 

1 
1 

136 

31,656 
3,415 

354.9 

Total 

83,340 

138 

The  degrees  of  freedom  are  such  that  each  regression  coefficient  takes  1  and 
the  residual  takes  what  is  left.  Similarly  the  Residual  sum  of  squares  is  what  is 
left  out  of  the  Total  after  the  total  explained  by  the  two  regression  coefficients 
has  been  subtracted. 

To  test  for  the  significance  of  the  second  (smaller)  regression  term  we  com- 
pare its  mean  square  with  the  residual.  Here  the  variance  ratio  is  3415/354.9 
=  9.6  with  degrees  of  freedom  nj  =  1,  n2  =  136.  This  is  more  significant  than 
the  1%  level  of  significance,  so  there  can  be  little  doubt  about  the  reality  of  the 
effect  of  xa. 

Accordingly,  to  estimate  the  value  of  y  for  any  particular  values  of  xw  and 
xa  we  use  the  equation 

y  =  3.466  +  2.454  xw  +  0.823  xa 
and  the  variance  of  any  estimate  is  given  by  the  Residual  variance  in  the  above 
table  of  analysis  of  variance,  namely  354.9.     The  standard  deviation  will  be  the 
square  root  of  this,  namely  18.8. 

A  modification,  due  to  Fisher,  of  the  above  procedure  is  often  useful.  For 
generality  we  will  use  xx  for  xw  and  x2  for  x*.     Thus  we  have 

S'y2  =  83340.0.  '    E'yxj    =    8477.0 

S'Xl2  =    2270.1.  S'yx2    =  17329.4 

£'x2a=  10525.0.  E'Xlxa  =    3530.1 
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We  first  solve  the  simultaneous  equations 

bx  S'(xx«)  +  b2  Z'(Xlxt)  =  1  (D) 

blS'(x1x1)  +  b1S'(xa*)  =  0  (E) 

and  the  solution  of  bx  we  denote  by  cai  and  the  solution  of  b2  by  Ca2.  The  bs 
are,  of  course,  the  standard  partial  regression  coefficients,  and  might  be  more 
completely  defined  as  in  the  previous  treatment  by  byi<2  and  by2.1.  In  the  present 
treatment  we  will  understand  the  y  and  latter  halves  of  the  subscripts.  Sub- 
stituting our  values  in  equations  (D)  and  (E),  we  get 

cM  .  2270.1  +  ca2  .     3530.1  =  1  (F) 

cai  .  3530.1  +  ca2  .  10525.0  =  0  (G) 

which  can  be  solved  by  e.g.  multiplying  the  first  (F)  by  10525.0/3530.1  =  2.9816, 
obtaining 

cai  .  6768.0  +  ca2  .  10525.0  =  2.9816 
From  this  we  subtract  the  second  equation,  (G),  to  obtain 

cai  .  3238.0  =  2.9816 
and 

cai  =  0.9208  x  10-8 

We  can  obtain  ca2  e.g.  by  substituting  this  value  of  cai  in  the  second  equation  as 
Ca2  =  —0.3088  X  10-8 

We  now  require  the  solutions  of 

b^x,2)    +  b2  £'(Xlx2)  =  0  (H) 

bx  S'(Xlx2)  +  b2  E'(x22)    =1  (I) 

and  we  will  denote  the  solutions  bx  and  b2  by  Cbi  and  Cb2  respectively. 

It  will  be  noted  that  equations  (H)  and  (I)  are  identical  with  (D)  and  (E) 
except  that  the  right  hand  sides  have  been  interchanged.  As  before,  we  can 
multiply  the  first  equation  (H)  by  2.9816,  obtaining. 

cbl  .  6768.0  +  cb2  .  10525.0  ==  0 
from  which  we  subtract  (I). 

We  thus  have 

Cbl  =  —0.3088  x  10-8 
and  substituting  in  either  (H)  or  (I)  we  get 
cb2  =  0.1986  x  10-3 

We  now  can  calculate  bx  and  b2  as 

bx  =  cai  .  E'fyxJ  +  ca2  S'(yx2)  (J) 

-  0.9208  x  10-8  x  8477.0  —  0.3088  x  1<H  X  17329.4 

=  2.454 
b2  =  Cbj  S'(yXl)  +  Cb2  S'(yx8)  (K) 

=  —0.3088  x  10-3  X  8477.0  +  0.1986  x  10-8  x  17329.4 

=  0.824 

These  solutions  of  bj  and  b2  are  of  course  the  same  as  obtained  previously 
for  byw.a  and  bya.w.  It  will  be  noted  now  where  one  of  the  principal  advantages 
of  this  procedure  lies.  We  can  easily  obtain  the  partial  regression  coefficients  for 
a  second  dependent  variable  Y  without  the  trouble  of  re-solving  the  simultaneous 
equations.  Thus  in  the  present  instances  we  are  only  concerned  with  one 
dependent  variable,  the  stack  loss  as  combined  nitrogen.  We  might  in  addition 
be  interested  in  the  ratio  of  nitrous  to  nitric  gases  and  the  variations  of  this  ratio 
with  the  two  independent  variables.  We  merely  need  to  obtain  the  sums  of 
products  of  the  new  dependent  variable  Y  with  xlf  and  x2,  E'(Yxj)  and  S'(Yx2) 
and  substitute  in  equations  (J)  and  (K).  The  additional  labour  is  thus  very  slight, 
especially  when  compared  with  re-solving  the  simultaneous  equations  as  with  the 
earlier  treatment. 
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The  second  advantage  of  the  present  procedure  is  that  we  can  now  readily 
obtain  the  standard  error  of  each  partial  regression  coefficient.  We  need  the 
residual  variance  ara  which  we  already  have  in  Table  10.1.  To  repeat,  in  the 
present  symbolism,  it  is 

°r2  =  — ^  P'(y)'  -  bt  X'(yxx)  -  ba  S'(yx,)] 
=  354.9 

Then  the  standard  error  of  bx  is 


*  Vcai  =  V354.9  V0.9208  x  10-8 
=  0.574 
and  the  standard  error  of  b2  is 

<*VEb2  =  V354^9  V0.1986  x  10-8 
=  0.266 

These  standard  errors  have  degrees  of  freedom  equal  to  (n  —  3),  and  since 
n  is  generally  large  any  regression  coefficient  twice  its  own  standard  error  can  be 
assumed  significant  at  the  5%  level.  This  procedure  is  thus  a  method  of  testing 
significance  alternative  to  that  set  out  previously  in  Table  10.1.  We  can  also  set 
the  probable  limits  (95%)  for  bx  as  2.454±2  X  0.574  =  1.306  and  3.602.  Putting 
it  this  way  makes  it  clear  how  many  significant  figures  are  worth  retaining  in  the 
final  statement  of  the  regression  equation. 


(c)  The  Need  for  multiple  Regression  and  Partial  Correlation 

In  deriving  the  regression  coefficients  byw.a  and  bya.w,  we  have  kept  the 
variable  in  the  subscript  after  the  full  stop  constant.  For  this  reason  these  are 
known  as  "partial"  regression  coefficients.  The  simple  regression  coefficients 
would  have  been  given  by 

,  gyxg  _  8477  _ 

°yw  —    VVli  —    OTTO  —  J-/M" 


S'xw2  2270 
S'yxa  __  17329 
Z'xa*  ~~  10525 


^yxa  _  17329  _ 


It  will  be  noted  that  these  simple  regression  coefficients  are  appreciably 
larger  than  the  corresponding  partial  regression  coefficients.  The  explanation  of 
this  is  easy  to  see.     If  we  calculate  the  correlation  coefficient  of  xw  and  xa  we  get 

£'  Xw  xa  _  3530 

rwa  =  Vlvlv  ~~  V2270  X  10525 

=  0.7222 

There  is  thus  a  highly  significant  and  very  important  correlation  between  xw  and 
xa.  This  is  a  consequence  of  the  nature  of  the  process,  of  course.  Weather 
conditions  that  lead  to  low  air  temperatures  (xa)  will  also  tend  to  give  low  cooling 
water  temperature  (xw). 

Thus  when  we  calculate  the  simple  regression  coefficient  of  y  upon  xw,  since 
the  higher  figures  for  xw  will  tend  to  be  associated  with  the  higher  figures  for 
xa,  the  values  of  y  for  the  larger  values  of  xw  will  tend  to  be  larger  than  they  should 
be  because  they  are  also  associated  with  the  larger  values  of  xa.  The  partial 
regression  coefficients,  however,  have  the  effect  of  the  other  variable  eliminated, 
and  measure  only  the  effect  of  the  variable  specified. 
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ryw  =    ,    ,     J  =    =.  =   =  0.6165 


In  this  connection  the  partial  correlation  coefficients  are  sometimes  used.     If 
the  simple  correlation  coefficients  are 
rwa  =  0.7222 

S'yxw  8477 

VS'y2  X  S'xw2    ~~  V83,340  x  2270 

gZ5  17>329  Q58S0 

ya        V2'y2X  S'xa2        V83,340  X  10,525 
then  the  partial  correlation  coefficients  are 

*yw         rya  ♦  Twa 


Tyw.a 


and 


V(l  —  rya2)  (1  —  rWa2) 
0.6165  —  0.5850  X  0.7222 
~  V(l  —  0.58502)  (1  —  0.72222) 
=  0.3458 

Tya  —  Tyw  «   Taw 


V(l  —  ryw2)  (1  —  raw2) 
0.5850  —  0.6165  x  0.7222 


V(l— 0.61652)  (1— 0.72222) 
=  0.2567 

For  testing  the  significance  of  a  simple  correlation  coefficient  the  degrees  of 
freedom  are  (n  —  2),  if  n  is  the  number  of  observations.  For  a  partial  correlation 
coefficient,  another  degree  of  freedom  is  lost,  and  the  number  is  thus  (n  —  3). 
In  the  present  case,  though  both  the  correlation  coefficients  are  markedly  reduced 
in  their  transformation  from  simple  to  partial  coefficients,  they  still  remain  signi- 
ficant with  this  large  number  of  degrees  of  freedom  (139  —  3).  In  general, 
however,  it  is  possible  for  a  positive  simple  correlation  coefficient  to  be  transformed 
into  a  negative  partial  correlation  coefficient,  if,  to  quote  the  formula  for  rya.w,  the 
product  of  ryw  and  raw  is  greater  than  rya. 

While  the  difference  between  the  simple  and  partial  coefficients  was  in  this 
case  of  interest  but  not  of  immediate  practical  importance,  it  might  well  be  other- 
wise in  other  cases.  Thus  suppose  it  had  been  decided  that  it  would  be  economic 
to  instal  refrigeration  equipment  on  the  basis  of  the  simple  regression  coefficient 
of  0.373.  If  the  refrigeration  would  lower  the  mean  cooling  water  temperature 
by  10°C,  we  could  expect  a  reduction  in  the  stack  loss  of  3.73  units  on  the  simple 
correlation,  and  this  might  be  sufficient  to  justify  the  installation.  The  reduction 
which  would  be  actually  achieved  is  2.44  units  (from  the  partial  regression  coeffi- 
cient), and  the  difference  between  this  and  3.73  might  be  the  difference  between 
an  economic  and  an  uneconomic  installation. 

(d)  Multiple  Correlation  with  Three  Independent  Variables 

In  the  example  we  have  been  considering,  the  loss  from  a  P.O.P.  stack,  there 
is  a  third  variable  of  considerable  interest,  namely  the  strength  of  acid  in  the 
absorption  tower,  which  we  will  denote  by  xs. 

We  are  setting  out  to  determine  the  coefficients  (a  and  the  bs)  in  the  equation 

y  ==  ay.was  ~r   Oyw.as  Xw  ~r   Oya.w8  Xa  -j-   Dys.aw  Xs 

where  the  symbolism  is  as  before  :   thus  • 

ay.was  is  the  value  y  would  have  if  xw,  xa,  and  Xs  were  held  constant  at  their 
mean  values. 

byw.as  measures  the  rate  of  increase  of  y  with  increasing  xw,  both  Xa  and  x» 
being  constant  at  their  mean  values. 

Similarly  for  bya.ws  and  bys.aw. 

77 


We  require  the  sums,  the  sums  of  squares,  and  the  sums  of  products  with 
the  other  three  variables.  As  it  is  desirable  to  retain  rather  more  decimal  places, 
the  complete  set  with  the  earlier  ones  recalculated  are  below  : — 


Sy          =  1479 

Sy2 

=  99077 

sy 

=  83340.014 

Sxw        =  406 

2xw2 

=  3456 

S'xw2 

=  2270.12950 

EXa          =    1 

Ex.* 

=  10525 

S'Xa2 

=  10524.9928 

Sxs         =  70 

SXsa 

=  36364 

S'x.2 

=  36328.74820 

Zyxw      =   12797 

S'yxw 

=  8477.0431 

Syx,      =   17340 

S'yxa 

=  17329.3597 

Syx.      =   12931 

S'yxs 

=  12186.1799 

SXwXa     =   3533 

2'x*Xa 

=  3530.07914 

SxaXs     =  5365 

S'xaXs 

=  5364.49640 

SXaXw     =    1943 

2'XsXw 

=  1738.5396 

We  now  need  to  solve  the  three  sets  of  three  simultaneous  equations  (A), 
(B)  and  (C)  below.  The  first  set  has  as  right  hand  side  for  (A),  (B)  and  (C) 
1,  0  and  0  respectively,  and  the  solutions  are  Cai,  Cbi,  and  Cd.  The  second  set 
has  as  right  hand  sides  0,  1  and  0  and  solutions  Ca2,  Cb2,  and  Cc2.  Similarly  the 
third  set  has  right  hand  sides  0,  0,  1  and  solutions  c83,  Cb3,  and  Cc8. 

Ca  S'Xw2  +    CbS'XwXa    -fc    Cc  E'xwX.     =    1,    0,   0  (A) 

C.  S'XaXw        +    Cb   S'Xs2      +    Cc  S'XsXa       =    0,    1,    0  (B) 

Ca  S'XaXw        +    Cb  E'xaX,    +    Cc  E'Xa2         =    0,   0,    1  (C) 

The  solutions  of  these  equations  can  be  obtained  empirically,  but  a  systematic 
method  is  less  liable  to  accidental  errors. 

Whittaker  and  Robinson*1)  contain  a  general  discussion,  but  the  most  satis- 
factory method  is  due  to  M.  H.  Doolittle.  We  set  out  the  equations  as  (A'), 
(B')  and  (C)  in  a  table  as  Table  10.2.  Inserting  the  numerical  values  quoted 
above,  we  get  (A"),  (B"),  (C").  It  will  be  understood  that  all  the  figures  occurring 
in  the  column  headed  ca  are  effectively  the  coefficients  of  c»  in  the  successive 
equations,  and  similarly  for  Cb  and  cc.  The  right  hand  sides  have  been  multiplied 
by  10*  temporarily  to  avoid  the  occurrence  of  excessively  large  numbers  of  0's 
after  the  decimal  points. 

The  next  step  is  to  write  down  again  (A")  as  (A'"),  and  then  put  underneath 
it  the  result  of  dividing  it  by  the  coefficient  of  ca  with  the  sign  changed  (here  by 
—2270.12950).    This  gives  us  (D). 

Now  set  down  (B")  again  as  (B'").  Multiply  (A'")  by  the  coefficient  of  cb 
in  (D)  (here  —0.76583279) ;  this  is  (E).  Add  to  (B'")  giving  (F).  Divide  (F) 
by  the  coefficient  of  its  first  term  with  the  sign  changed  (i.e.  by  — 34997.3536) 
to  give  (G). 

Now  set  down  (C")  again  as  (C"').  Multiply  (A'")  by  the  coefficient  of 
cc  in  (D)  (i.e.  by  —1.55501223)  to  give  (H).  Also  multiply  (F)  by  the  coefficient 
of  cc  in  (Q)  to  give  (I).  Adding  (C"),  (H)  and  (I)  we  get  J,  and  dividing  through 
by  the  coefficient  of  Cc  (4833.3424)  we  get  (K).  The  three  figures  on  the  right 
hand  side  are  the  solutions  for  cCi,  cC2,  and  Cc3,  all  multiplied  by  10*. 

If  we  now  take  (G),  and  substitute  these  solutions  for  Cci,  Cc2,  and  Cc8  we 
get  (LJ,  (L2)  and  (L3)  respectively.  These  lead  immediately  to  (L\),  (L',),  and 
(L'a),  which  give  the  solutions  for  Cbx,  Cb2,  and  Cb3,  respectively. 

We  now  substitute  the  solutions  for  Cb  and  cc  in  (D)2  getting  (Mx),  (M2)  and 
(M3),  leading  to  (M'j),  (M'2)  and  (M'3)  as  the  solutions  of  cai,ca2,  and  Ca3  respectively. 

We  can  summarise  these  solutions  in  a  matrix  as  in  Table  10.3. 

(»)  E.   T.   Whittaker   and   G.   Robinson,   "The   Calculus   of  Observations"    (Blackie), 
3rd  edition,  Chapter  IX. 
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Table  10.2 


Ca 

Cb 

Cc 

(A') 
(B') 
(C) 

S|xw2 
2j  x8xw 

S'XaXw 

S'XWXS 

S'xs2 

2'xaXs 

S  xwxa 

S'xsXa 
2'xa2 

A" 
B" 
C" 

2270.12950 

1738.5396 

3530.07914 

1738.5396 

36328.7482 

5364.4960 

3530.07914 
5364.4960 
10524.9928 

A'" 
D 
B'" 
E 

2270.12950 
—1.00000 
1738.5396 
—1738.5396 

1738.5396 
—0.76583279 
36328.7842 
—1331.4306 

3530.07914 
—1.55501223 

5364.496t) 
—2703.4503 

F 
G 
C" 

(H) 

m 

0.0000 

3530.07914 
—3530.07904 

34997.3536 

—1.0000 

5364.4960 

—2703.4502 

—2661.0458 

2661.0457 
—0.076035630 

10524.9928 

—5489.3161 

—202.3343 

j 

K 

0.0000 

0.0000 

4833.3424 
1.00000 

U 
U 

—1.0000 
—1.0000 
—1.0000 

0.2354659 

0.0119615 

—0.1573148 

L'2 
L'3 

1.0000 
1.0000 
1.0000 

Mi 

M2 
M3 

—1.0000 
—1.0000 
—1.000 

—0.0127435 

—0.2279864 

0.1204768 

4.8155366 

0.2446261 

—3.2172607 

M'2 
M'3 

1.0000 
1.0000 
1.0000 
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Table  10.2— contd. 


First  Set 
X  10* 

Second  Set 
X  10* 

Third  Set 
X  10* 

(A') 
(B') 
(C) 

10,000 
0 
0 

0 

10,000 

0 

0 

0 

10,000 

A" 
B" 

C" 

10,000 
0 
0 

0 

10,000 

0 

0 

0  - 
10,000 

A!" 
D 
B'" 
E 

10,000 
—4.4050351 

0 
—7658.3279 

0 

0 

10,000 

0 

0 
0 
0 
0 

F 
G 
C" 

(H) 
(I) 

—7658.3279 

0.218825915 

0 

—15550.1223 

582.3058 

10,000 

—0.28573589 

0 

0 

—760.35630 

0 
0 
10,000 
0 
0 

J 
K 

—14967.8165 
—3.0967838 

—760.35630 
—0.1573148 

10,000 
2.0689617 

u 

u 
u 

0.2188259 

—0.2857359 

0.0 

L'2 

0.0166400 

0.2976974 

— 0.1573148 

M2 
M3 

—4.4050351 

0 

0 

M'2 
M'3 

9.2078282 

0.0166397 

—3.0967839 
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Table  10.3 
All  solutions  multiplied  by  10* 


a 

b 

c 

1 

9.2078282 

0.0166400 

—3.0967838 

2 

0.0166397 

0.2976974 

—0.1573148 

3 

—3.0967839 

—0.1573148 

2.0689617 

Like    the    original    equations,    this    matrix    is    axi-symmetric  ;     ci>i  =  Ce2, 
Cci  =  cas,  and  cC2  =  Cb3. 

A  good  check  on  the  correctness  of  the  solutions  is  to  take  the  set  for  ca  and 
substitute  them  in  (C").     There  we  get  for  the  left  hand  side : 

(3530.07914  x  9.2078282  +  5364.4960  x  0.01663970  —  10524.9928  X 
3.0967839)  x  10-4  =  0.00000025 
which  is  close  enough  to  the  expected  value  of  0. 

We  now  have  as  our  values  for  the  regression  coefficients 

bw.as      =    Cai  S'(y  Xw)  +  Ca2  S'(y  X8)  +  Ca3  S'(y  Xa) 

=  [9.2078282  x  8477.0431  +  0.0166397  x  12186.1799 

—  3.0967839  x  17329.3597]  x  10-4 
=  [78055.1564  +  202.7744  —  53665.2795]  x  10-4 
=  2.4592651 
bg^m     =  cbx  S'(y  xw)  +  Cb2  S'(y  Xs)  +  cb3  S'(y  xa) 
=  0.1042687 

ba.sw      =    Cci  S'(y  XW)  +  Cc2  S'(y  X.)  +  CC3  S'(y  xa) 

=  0.7685143 
A  useful  check  on  the  accuracy  of  these  last  proceedings  is  to  substitute  these 
values  in  the  equation 

bw.as  S'xaXw  +  bs.aw  E'xaX8  +  ba.wa  S'xa2  =  S'y  Xa 

Here  we  get  17329.3566  =  17329.3597,  a  discrepancy  of  0.0031  which  is 
small  enough  to  be  ignored. 

The  residual  variance  is  given  by 

or2       =    -    — r  P'y2  —  bw.as  S'yxw  —  b8.aw  S'yx8  —  b..8W  S'yxa] 


n  —  *. 

=  ^-c  [83340.014 
16b  10 


2.4592651   x   8477.0431  —  0.1042687   x 
12186.1799  —  0.7685143  x  17329.3597] 

=  47904.221/135  =  354.846 

The  standard  errors  of  the  three  regression  coefficients  are  then  given  by 

CTrVc 


<*w.i 


<*s.a 


&1 


V354.847  x  9.2078  x  10-4 
0.5716 


OrVcb2   = 
OrVccs    = 


0.1028 

aa.^  =  arVCcs  =  0.2709 
To  test  the  significance  of  each  coefficient,  we  obtain  the  value  of  t  as  the  ratio 
of  the  regression  coefficient  to  its  standard  error,  and  look  up  this  value  of  t  with 
degrees  of  freedom  equal  to  those  of  the  residual  variance.  For  ba.8w,  for  example, 
t  =  0.7685143/0.2709  =  2.837  with  135  degrees  of  freedom.  Reference  to 
Table  I  in  the  Appendix  shows  that  this  corresponds  to  a  level  of  significance  of 
between  1%  and  0.1%. 
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Confidence  limits  for  each  estimate  can  be  derived  in  the  usual  way.  For 
b..8W  and  95%  limits  these  are  0.768514  ±  1.98  x  0.2709,  or  between  0.232132 
and  1.304896. 

Incidentally,  it  will  be  obvious  that  we  have  retained  an  excessive  number 
of  significant  figures,  and  we  could  comfortably  discard  the  last  4.  It  is  much 
better,  however,  to  carry  out  this  discarding  at  the  very  end  of  the  process  rather 
than  at  the  beginning. 

The  final  regression  equation  is 
y  —  Y  =  bw.a8  (xw  —  xw)  -f-  b8.»w  (xg  —  x8)  +  b».sw  (xa  —  Xa) 
y  —  1479/139  =  2.459265  (xw  —  406/139)  +  0.104269  (x8  —  70/139) 

+  0.768514  (xa— 1/139) 
or  y  =  2.459  xw  +  0.104  x8  +  0.768  xa  +  3.399 

(e)  Conclusions 

The  use  of  this  multiple  regression  technique  involves  rather  heavy  arith- 
metic, but  when  we  have  a  mass  of  data  with  several  independent  variables  it  is 
the  only  sound  way  of  assessing  their  several  effects. 

Thus  in  the  present  instance,  we  were  particularly  interested  in  the  inde- 
pendent variable  with  the  smallest  regression  coefficient,  x«.  Completely  elim- 
inating the  effects  of  the  other  two  independent  variables  we  obtained  an  estimate 
of  the  regression  coefficient  and  were  able  to  test  its  significance  against  a  residual 
of  354.8,  against  which  it  was  quite  non -significant.  If  we  had  ignored  the  other 
two  variables  we  would  have  obtained  the  sum  of  squares  due  to  Xs  as  4088,  with 
a  residual  of  79252/137=578.5.  Tested  against  this  x«  would  appear  significant 
at  the  1%  level,  an  entirely  erroneous  conclusion. 

The  second  reason  for  using  the  multiple  regression  technique  was  discussed 
at  length  earlier  with  reference  to  the  misleading  results  obtained  when  using 
simple  regression  when  the  independent  variables  happen  to  be  correlated.  In 
this  case  the  independent  variables  both  had  positive  regression  coefficients  and 
were  positively  correlated,  and  so  gave  fictitiously  large  values  for  both  the 
regression  coefficients.  If,  however,  they  were  negatively  correlated,  we  would 
obtain  fictitiously  small  values  for  the  regression  coefficients  using  the  simple 
regression  technique.  Of  course,  if  the  independent  variables  are  not  correlated, 
then  this  phenomenon  will  not  occur,  and  the  only  loss  through  using  simple 
regression  instead  of  partial  (or  multiple)  regression  is  the  loss  of  precision  in 
our  estimate  through  the  residual  being  much  larger. 

As  with  simple  regression  equations,  extrapolation  of  a  multiple  regression 
equation  is  unwise  unless  there  is  a  very  sound  theoretical  basis.  It  is  good 
practice  to  quote  the  ranges  of  the  independent  variables  that  have  occurred  in 
the  data  used  for  calculating  the  regression  equation,  in  order  to  discourage 
extrapolation. 


CHAPTER  XI 
THE  GENERAL  ANALYSIS  OF  VARIANCE 

(a)  Introduction 

It  is  important  to  realize  that  a  situation  suitable  for  the  analysis  of  variance 
may  arise  in  several  ways : — 

(i)  By  accident :  the  process  may  have  a  suitable  structure  as  it  stands.  An 
example  of  this  is  discussed  in  Section  (i)  of  this  chapter,  where  there  are  three 
factories  each  handling  three  types  of  weak  acid.  The  statistician  had  no  part 
in  seeing  that  there  were  three  factories  for  each  type  of  weak  acid  :  this  happened 
quite  fortuitously. 
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(ii)  By  slight  modification  of  existing  manufacturing  procedure.  Thus  it 
may  be  that  in  a  process  to  each  unit  is  added  a  variable  amount  of  rework  material 
of  uncertain  history,  and  for  the  purpose  of  examination  this  practice  could  be 
discontinued  temporarily. 

(iii)  By  deliberate  experiment.  Thus  several  factors  may  be  varied  at  the 
levels  appropriate  to  a  factorial  design. 

(b)  Types  of  Analysis 

One  of  the  difficulties  frequently  experienced  in  the  use  of  the  analysis  of 
variance  is  to  decide  which  is  the  appropriate  form  of  analysis.  A  further  diffi- 
culty is  deciding  the  procedure  for  tests  of  significance.  An  attempt  will  be  made 
here  to  give  an  outline  of  a  general  treatment. 

In  the  succeeding  examples,  we  shall  deal  with  a  hypothetical  polymer  being 
processed  into  fibres  and  the  dependent  variable  is  the  fibre  tensile  strength. 
We  shall  make  various  modifications  to  this  basic  idea  in  an  attempt  to  show 
how  the  various  forms  of  analysis  of  variances  are  related  to  each  other.  After 
each  form  of  this  general  example  we  shall  deal  in  detail  with  another  example 
of  similar  structure.  A  number  of  aspects  of  the  use  of  the  analysis  of  variance 
follow  in  Chapter  XII. 

All  analyses  can  be  divided  into  two  broad  types : — 

(i)  Those  in  which  categories  preserve  their  one-to-one  correspondence : 
these  we  can  define  as  pure  factorial  analyses. 

(ii)  Those  in  which  certain  of  the  categories  do  not  preserve  their  one-to-one 
correspondence.     These  we  can  define  as  incomplete  analyses. 

The  meaning  of  these  two  classifications  will  become  clear  with  detailed 
discussion. 

(c)  The  Two  Factor  Analysis 

We  can  frequently  categorise  a  set  of  individuals  in  two  ways  :  thus  we  can 
have  the  raw  material  classified  into  a  number  of  grades  g  according  to  the  nominal 
degree  of  polymerisation.  There  are  available  a  number  of  batches  of  each  grade, 
and  one  batch  of  each  grade  can  be  processed  at  each  of  t  different  temperatures. 
We  then  have  two  factors :  grade  of  polymer  with  g  levels  and  temperature  of 
processing  with  t  levels.  If  we  specify  the  values  of  g  and  t  for  any  individual, 
we  have  completely  identified  that  individual. 

Such  an  example  was  discussed  in  Chapter  VII  (e).  Table  11.1  below  gives 
the  degrees  of  freedom  and  components  of  variance. 

Table  11.1 


Source  of 
Variance 

Degrees  of 
Freedom 

Components 
of  Variance 

G 
T 

Residual 

g-i 

t— i 

(g-i)(t-i) 

tag2  -f  ao2 
gat2  +  ao2 
ao2 

Total 

gt-i 

The  tests  of  significance  are  obvious.     If  factor  G  is  to  exist,  i.e.  if  ag2  is  to 
be  greater  than  zero,  then  we  must  have  : — 
tog2  -f-  ao2  must  be  greater  than  a02 
and  we  can  readily  check  the  significance  of  this  with  the  variance  ratio  test. 
Factor  T  can  be  tested  similarly. 
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(d)  The  Three  Factor  Analysis 

(i)  In  the  previous  section,  we  had  g  grades  of  polymer,  one  batch  of  each 
grade  being  processed  at  each  of  t  different  temperatures.  Suppose  now  that 
each  of  s  suppliers  supplies  g  grades  of  polymer  and  from  each  grade  of  each  supplier 
t  batches  are  processed  at  each  of  t  temperatures. 

It  is  important  that  the  first  grade  from  Supplier  1  has  the  same  nominal 
degree  of  polymerisation  as  the  first  grade  from  Supplier  2,  and  as  the  first  grade 
from  Supplier  3,  etc.  Thus,  the  first  grades  for  all  the  suppliers  have  a  certain 
common  property  which  distinguishes  them  in  some  way  from  the  second  grades 
of  all  the  suppliers,  etc. 

There  are  now  gts  individuals,  and  each  can  be  uniquely  determined  by 
specifying  particular  values  of  g,  t,  and  s.  We  thus  have  three  factors  :  suppliers, 
grades  of  polymer  and  temperature  of  processing.  The  technique  of  the  analysis 
of  variance  for  three  factors  is  given  later.  The  degrees  of  freedom  and  the 
components  of  variance  are  reproduced  in  Table  11.2. 

Table  11.2 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of  Variance 

G 
T 

S 
G  x  T 

T  x  S 

S  x  G 

Residual 

g-1 
t  — 1 
s  — 1 

(g— l)(t— 1) 
(t— l)(s  — 1) 

(s-l)(g-l) 
(g-l)(t-l)(s-l) 

(So2  +  SOgt2  -f-  tCTgs2  +  tSCJg2 

Co2  +  g<*s2  -j-  sagt2  +  sgat2 
aQ2  +  tasg2  -j-  gost2  +  gtas2 
Co2  +  sagt2 
ao2  +  ga8t2 
ao2  +  taSg2 

CTo2 

Total 

gts  — 1 

For  the  tests  of  significance  we  must  start  with  the  smallest  interaction  and 
test  it  against  the  residual.  Thus,  if  the  smallest  interaction  is,  say,  S  X  G,  we 
test  with  the  variance  ratio  test  (a02  +  tasg2)  against  a02.  If  it  is  not  significant, 
we  pool  it  with  the  residual  and  obtain  a  new  estimate  of  a02 ;  similarly,  with 
the  other  interactions,  taking  them  in  ascending  order  of  magnitude. 

For  the  testing  of  the  main  effects,  there  are  a  number  of  possibilities : — 
(i)  All  three  interactions  are  non-significant.     Striking  them  out  in  the  table 
of  components  of  variance  in  Table  1 1 .2,  and  pooling  their  degrees  of  freedom 
with  the  residual,  we  obtain  Table  11.3.  : — 

Table  11.3 


Source  of 
Variance 

Degrees  of 
Freedom 

Components 
of  Variance 

G 
T 

S 
Residual 

g-1 
t— 1 
s— 1 

gts-(g  +  t  +  s)  +  2 

o02  +  tsag2 
aQ2  +  sgat" 
Oo8  -f  gtas2 
Co* 

Total 

gts— 1 
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For  the  three  main  effects  the  tests  of  significance  are  quite  obviously  to 
compare  (o02  +  tsag2)  with  a02  for  the  factor  G,  and  similarly  for  T  and  S. 

(ii)  One  interaction,  say,  G  X  T,  is  significant,  and  the  others  are  not. 
Striking  the  latter  out  in  Table  11.2,  we  thereby  obtain  Table  11.4: — 

Table  11.4 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of 
Variance 

G 
T 

S 

G  x  T 

Residual 

g-1 
t  — 1 

8—1 

(g-l(t-l) 

(gt-l)(s—l) 

a02  -f-  SCfgt2  +  tsag2 
aG2  +  sagt2  +  sgot2 
Co2  -j-  gtas2 
a02  +  sagt2 
ao2 

Total 

gts— 1 

To  test  for  the  existence  of  the  S  main  effect,  we  need  to  demonstrate  the 
existence  of  as2,  and  this  can  be  rigorously  done  by  testing  (a02  +  gta82)  against 
a0a.  For  the  G  main  effect,  we  need  to  demonstrate  the  existence  of  ag2,  and  to 
do  this,  we  must  show  that 

o02  +  sagt2  +  tsag2  is  greater  than  a02  +  sagt2, 
i.e.  we  test  the  G  main  effect  against  the  G  X  T  interaction.     The  T  main  effect 
is  tested  similarly. 

(iii)  The  third  possibility  is  that  two  interactions  are  significant,  e.g.  G  X  T 
and  T  X  S.     Table  11.2  then  becomes  Table  11.5  : 


Table  11.5 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of 
Variance 

G 

T 

S 

Gx  T 

Tx  S 

Residual 

g-1 
t  — 1 
s  — 1 

(g-l)(t-l) 
(t-l(s-l) 

t(g-l)(s-l) 

a02  -f-  sagt2  -j-  tsag2 

aQ2  +  gats2  +  sagt2  +  sgot2 

a02  +  gast2  +  gtas2 

a02  +  sagt2 

a02  +  tast2 

ao2 

Total 

gts— 1 

Studying  the  Components  of  Variance,  it  will  be  apparent  that  the  significance 
of  the  G  and  S  main  effects  can  be  established  by  testing  their  mean  squares 
against  the  G  X  T  and  the  T  X  S  interactions  respectively.  However,  there  is 
no  rigorous  test  to  demonstrate  the  existence  of  the  T  main  effect,  for  while  we 
might  prove  that  it  was  greater  than  the  G  X  T  interaction,  this  might  be  due 
to  the  existence  of  ats2  as  much  as  to  the  existence  of  at2. 

In  practice  this  impasse  is  not  of  any  great  practical  importance,  as  we  would 
break  down  the  analysis  into  separate  two  factor  GxS  analyses  for  each  level 
of  T.     This  procedure  is  illustrated  in  Section  (e)  of  this  chapter. 
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Let  us  consider  the  data  in  the  table  below,  referring  to  a  purification  process. 
The  figures  (the  dependent  variable  x)  refer  to  the  purity  of  a  product  measured 
in  a  certain  way.  The  factor  T  at  three  levels  represented  time  of  boiling.  The 
factor  A  represents  solvent  used,  and  Wj  represents  a  cold  final  wash  of  the  product 
and  W,  a  hot  final  wash. 


Tx 

Ta 

T, 

Ax        Wx 

w2 

3.1 
2.1 

2.0 
1.6 

1.3 
1.9 

A*        Wx 

w2 

4.3 
6.1 

2.8 
3.9 

4.1 
2.6 

In  analysing  this  data,  an  arbitrary  zero  of  3.0  was  taken.  This  reduces 
the  magnitude  of  all  the  numbers,  and  therefore  their  squares,  and  the  arithmetic 
becomes  less  burdensome.  Also,  to  avoid  confusion  with  the  decimal  point,  the 
numbers  were  then  multiplied  by  10.  This  results  in  the  sums  of  squares  in  the 
analysis  of  variance  being  102  =  100  times  too  large,  which  does  not  affect  the 
analysis,  as  in  the  tests  of  significance  all  the  terms  are  compared  with  each  other, 
the  change  in  scale  thus  being  of  no  importance.  If  we  proceed  to  calculate  any 
particular  variance,  however,  we  must  correct  for  this  change  of  scales  by  dividing 
by  100. 

The  result  of  the  transformation  is  as  below : 


Tx 

T, 

T, 

A, 

Wx 

1 

—10 

—17 

wa 

—9 

—14 

—11 

A, 

Wx 

13 

—2 

11 

w2 

31 

9 

—4 

It  should  be  remembered  that  the  data  is  completely  symmetrical  with  respect 
to  the  three  variables,  A,  W,  and  T.  This  might  be  clearer  if  we  were  able, 
using  three-dimensional  paper,  to  set  the  lower  half  of  the  table  (for  A2)  on  top 
of  the  upper  half  of  the  table  (for  Ax).  Then  the  horizontal  axis  across  the  page 
represents  difference  between  times  of  boiling  (T),  the  vertical  axis  down  the  page 
represents  type  of  wash  (W),  and  the  axis  perpendicular  to  the  page  represents 
type  of  solvent  (A).  The  table  as  it  stands  has  the  form  of  tables  for  W  and  T  for 
each  value  of  A :  that  is  an  arbitrary  choice  and  it  might  equally  well  have  been 
W  and  A  tables  for  each  value  of  T,  or  A  and  T  tables  for  each  value  of  W. 

We  first  form  the  three  two-way  tables  by  summing  over  each  of  the  three 
variables  in  turn.  Thus  we  sum  over  W  for  each  value  of  A  and  T,  getting  the 
table  below : 


Tx 

T2 

T, 

Totals 

Ax 
A, 

—8 

44 

—24 
7 

—28 

7 

—60 
58 

Totals 

36 

—17 

—21 

—2 
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Summing  over  A  for  each  value  of  T  and  W  we  get  the  table  below: 


Tx 

T2 

T3 

Totals 

w2 

14 

22 

—12 

—5 

—6 
—15 

—4 

+2 

Totals 

36 

—17 

—21 

—2 

The  third  table  is  obtained  by  summing  over  T  for  each  value  of  A  and  W, 
below : 


Ax                   A2 

Totals 

w2 

—26                22 
—34               36 

—4 

2 

Totals 

—60               58 

—2 

We  analyse  the  first  of  these  two-way  tables,  for  A  and  T,  as  follows  : 

Form  the  following  terms — 

(1)  Square  the  individuals,  sum  the  squares,  and  divide  by  the  number  of 
original  individuals  that  have  been  summed  to  give  the  individuals  in  this  two-way 
table.     Here  each  individual  is  the  sum  of  two  original  individuals.     Thus  we  have 

[(— 8)2  +  (— 24)2  +  (— 28)2  +  442  +  72  +  72]/2  =  1729 

(2)  Obtain  the  total  for  each  row,  square  these  totals,  sum  these  squares, 
and  divide  this  total  by  the  number  of  original  individuals  that  have  been  summed 
to  give  the  row  totals.  Here  each  row  total  is  the  sum  of  6  original  individuals 
(the  row  totals  are  each  the  sum  of  3  two-way  table  individuals,  each  of  which  is 
the  sum  of  2  original  individuals).     Thus  we  have 

[(— 60)2  +  (58)2]/6  =  1160.67 

(3)  Perform  the  identical  operation  for  the  column  totals.  Here  each  column 
total  is  the  sum  of  4  original  individuals,  so  the  divisor  is  4.     Thus  we  have 

[362  +  (— 17)2  +  (— 21)2]/4  =  506.5 

(4)  Obtain  the  grand  total  for  all  individuals,  square  this  grand  total,  and 
divide  by  the  grand  total  number  of  original  individuals 

(— 2)2/12  =  0.33 

It  will  be  noted  that  these  operations  are  similar  to  the  analysis  of  a  row  and 
column  table,  discussed  earlier,  differing  only  in  that  each  sum  of  squares  is 
divided,  not  by  the  number  of  individuals  in  the  two-way  table  in  each  total 
which  is  being  squared,  but  by  the  number  of  original  individuals  in  each  total 
that  is  being  squared. 
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We  can  now  form  a  table  for  the  analysis  of  variance  of  this  row  and  column  table 


Source  of  Variance 

Degrees  of 
Freedom 

Sums  of  Squares 

Mean 
Squares 

Between  A 
Between  T 
A  X  T  interaction 

a— 1 = 2— 1 

-  1 
t  — 1  =  3  — 1 

=  2 
(a-l)(t-l) 

=  2 

(2)  — (4)=  1160.33 

(3)  — (4)=    506.17 

62.17 

1160.33 

253.08 

31.08 

Total 

at  — 1  =  5 

(1)  _  (4)  =  1728.67 

a  =  number  of  levels  of  A 
t  =  number  of  levels  of  T 


This  table  corresponds  closely  to  that  previously  discussed  for  row  and  column 
analysis,  except  that  what  was  formerly  "Residual"  is  now  termed  "A  X  T 
interaction."  The-reason  for  this  will  be  discussed  later.  The  Sum  of  Squares 
corresponding  to  this  term  is  best  obtained  as  the  difference  between  the  sum 
of  the  A  and  T  terms  and  the  Total,  i.e. 

1728.67  —  (1160.33  +  506.17)  =  62.17 

Precisely  similar  operations  are  performed  on  the  other  two-way  tables. 

Finally,  the  Total  Variance  is  obtained  by  squaring  all  the  original  indi- 
viduals, summing  these  squares,  and  subtracting  from  this  sum  of  squares  the 
total  squared  divided  by  the  total  number  of  original  individuals,  i.e. 

12  +  (_io)*  +  (—17)2  +  (—9)2  +  . . .  (—4)a  —  £=?)!  =  2139.67 
All  these  results  are  now  gathered  together  in  Table  11.6. 

Table  11.6 


Source  of  Variance 

Degrees  of  Freedom 

Sums  of 
Squares 

Mean 
Squares 

Components  of  Variance 

Between  A 

(a  —  1)  =  i 

1160.33 

1160.33 

WtoJ  +  WT^  +  W^     +G2Q 

Between  W 

(w  —  1)   =   I 

3.00 

3.00 

at<   +<+Wwa  +  °o 

Between  T 

(t  —  1)    =■    2 

506.17 

253-08 

was*  +  wa^t  4-acr^  4-  a2Q 

A  X  W  Interaction 

(a  —  1)  (w  —  i)  -  i 

40-33 

40.33 

<a  +  °3 

W  X  T  Interaction 

(w—  l)(t—  I)   =   2 

45-5Q 

23-75 

^wt+^o 

T  X  A  Interaction 

(t  —  i)  (a  —  i)  -  2 

62.17 

31-08 

<  +  < 

Residual 

(a  —  i)  (t  —  1) 

(W  —    I)    =    2 

322.17 

161.08 

+< 

Total 

awt  —  i  =  1 1 

2139.67 

a,  w,  and  t  are  respectively  equal  to  the  number  of  levels  of  A,  W,  and  T. 

ol ,  gI,  and  ot2  are  respectively  equal  to  the  variances  due  to  general  differences 
between  levels  of  A,  levels  of  W,  and  levels  of  T. 

°aw>  CTwt>  CTui  are  respectively  equal  to  the  variances  due  to  interactions  between 
A  and  W,  between  W  and  T,  and  between  T  and  A. 

ol  is  the  residual  variance,  unattributable  to  any  of  the  above  terms. 

The  sum  of  squares  corresponding  to  the  residual  is  obtained  as  the  difference 
between  the  total  sum  of  squares  and  the  sum  of  the  six  other  terms. 


The  meaning  of  these  terms  is  as  follows  : 

o2  is  the  variability  that  would  occur  if  the  other  two  factors  W  and  T  were 
held  constant,  similarly  a^  is  the  variability  that  would  occur  if  A  and  T  were  held 
constant  and  of  the  variability  that  would  occur  if  A  and  W  were  held  constant. 

The  A  X  W  Interaction  a/w  is  a  measure  of  the  extent  to  which  the  A  effect 
depends  on  the  value  of  W,  and  conversely,  to  which  the  W  effect  depends  on  the 
value  of  A.     Similarly  for  the  other  interactions. 

The  Residual  a2  is  that  portion  of  the  variability  that  cannot  be  allocated 
to  the  other  six  terms.  It  can  be  regarded  as  a  measure  of  error.  In  the  present 
experimental  design,  the  possible  second  order  interaction  A  X  W  X  T,  if  it 
exists,  will  contribute  to  the  same  term.  The  second  order  A  X  W  X  T  measures 
the  extent  to  which  the  A  X  W  interaction  depends  upon  the  level  of  T,  or 
similarly,  to  which  the  A  X  T  interaction  depends  on  the  level  of  W,  similarly, 
to  which  the  T  X  W  interaction  depends  on  the  level  of  A.  If  this  effect  does 
exist,  it  will  give  us  an  inflated  estimate  of  error,  and  hence  the  effects  will  be 
likely  to  not  attain  significance.  This  situation  is  discussed  later  in  Chapter  XII  (e). 
For  the  present  we  will  assume  that  the  A  X  W  x  T  interaction  does  not  exist, 
or  at  any  rate  is  not  large. 

We  now  need  to  apply  a  test  of  significance  for  the  existence  of  these  terms. 
We  can  see  that  if,  for  example  ct2a  is  to  exist,  then  (wata  +  <*«)  must  be  signi- 
ficantly greater  than  a2,  i.e.  the  T  X  A  interaction  mean  square  must  be  signi- 
ficantly greater  than  the  residual  mean  square. 

Here  we  see  that  the  Mean  Squares  for  the  W,  A  X  W,  W  X  T,  and  T  X  A 
are  all  less  than  the  Residual  Mean  Square,  so  that  o2,  o^,  o^,  and  at2a  do  not 
exist,  i.e.  are  zero.  We  can  therefore  use  these  mean  squares  as  estimates  of  ai 
to  get  a  more  accurate  estimate.  We  pool  their  sums  of  squares  and  the  degrees 
of  freedom,  i.e. 

(3.00  +  40.33  +  45.50  +  62.17  +  322.17)/(1  +  1  +  2  +  2  +2)  =  59.1 
This  new  residual  has  8  degrees  of  freedom. 

Considering  the  T  term,  the  T  mean  Square  now  estimates  wa  ot2  -f-  o; 
for  oft  and  o^t  have  been  shown  to  be  zero.  To  establish  the  existence  of  <rt  we 
therefore  test  the  T  mean  square  against  the  (new)  residual.  The  variance  ratio 
is  253.08/59.1  =  4.3  for  degrees  of  freedom  nx  =  2,  n8  =  8.  This  lies  close  to 
the  5%  level  of  significance,  so  we  may  assume  that  T  does  have  an  effect. 

Similarly  the  A  term  gives  a  variance  ratio  of  1160.33/59.1  =  19.6  for  degrees 
of  freedom  ni  =  1,  n2  =  8,  this  being  considerably  more  significant  than  the 
1%  level,  so  we  assume  that  A  does  have  an  effect. 

We  now  can  take  the  means  for  each  level  of  A  and  T,  and  conclude  that 
for  Ax  we  have,  translating  back  into  terms  of  the  original  variable,  a  mean  value 
2.00  and  for  A,  a  mean  value  of  3.97.  For  Tj  the  mean  is  3.90,  for  T2  2.58,  and 
for  T8  2.48.  It  is  clear  that  the  difference  between  T2  and  T8  is  so  slight  as  to 
be  not  significant,  but  the  difference  between  these  two  and  Tj  is  quite  consider- 
able. We  can  also  conclude  that  the  difference  between  the  two  levels  of  W  is 
quite  insignificant. 

Each  of  the  averages  for  the  three  levels  of  T  is  the  average  of  4  results.  In 
the  units  in  which  the  analysis  of  variance  was  carried  out  the  residual  variance 
was  59.15,  corresponding  to  a  standard  deviation  of  7.691,  or  converting  to  original 
units  0.7691.  Accordingly  the  standard  deviation  of  the  average  of  four  results 
will  be  0.7691 /V4=  0.3845.  This  then  is  the  standard  deviation  of  each  of  the 
three  averages  quoted  above  for  the  three  levels  of  T.  The  standard  deviation 
of  the  difference  between  any  two  is  0.3845  V2  =  0.5429. 

Proceeding  similarly  for  the  two  averages  for  the  two  levels  of  A,  these  each 
have  standard  deviation  of  0.7691  /V6  =  0.3140,  and  the  standard  error  of  the 
difference  between  these  two  averages,  which  is  3.97  —  2.00  =  1.97,  is  0.3140V2 
p=  0.4440.     The  residual  on  which  this  standard  error  is  based  has  8  degrees  of 
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freedom,  for  which  for  the  5%  level  of  significance  t  =  2.31  (see  Table  I, 
Appendix).  Accordingly  the  95%  confidence  limits  for  the  difference  between 
Ax  and  A?  are  1.97  ±  2.31  X  0.444,  or  3.00  and  0.94. 

In  this  example,  having  demonstrated  the  significance  of  certain  effects,  we 
have  proceeded  to  calculate  the  means  for  each  level  of  these  effects  in  order  to 
demonstrate  their  mode  of  action.  In  this  example  each  factor  and  its  levels 
had  precise  and  obvious  meanings,  and  when  we  compared  At  with  A2  we  knew 
what  comparison  we  were  making  (we  were  comparing  one  solvent  with  another). 
In  some  types  of  analysis,  however,  the  difference  between  different  levels  of  a 
particular  factor  may  not  be  obvious.  Thus  a  factor  may  be  shift,  with  3  levels. 
Or  it  may  be  day,  with  any  number  of  levels.  Or  it  may  be  machine  for  a  group 
of  allegedly  similar  machines.  With  this  type  of  factor,  where  the  distinction  between 
the  levels  is  rather  vague  and  elusive,  it  is  often  better  to  calculate  the  actual 
magnitude  of  the  variance  due  to  each  term. 

(e)  The  Four  Factor  Analysis 

( i)  In  the  previous  section,  we  had  g  grades  of  polymer  by  each  of  s  suppliers 
being  processed  at  each  of  t  temperatures.  If  we  suppose  that  the  suppliers  were 
supplied  with  raw  material  from  each  of  r  sources,  then  R  becomes  a  fourth  main 
factor.  There  are  thus  gtsr  individuals,  and  each  can  be  uniquely  determined 
by  specifying  particular  of  values  of  the  categories.  It  is  important  that  every 
supplier  draws  from  each  source  of  raw  material. 

The  components  of  variance  are  reproduced  in  Table  11.8  below,  along  with 
the  respective  degrees  of  freedom. 

Table  11.8 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of 
Variance 

G 

T 

S 

R 

Gx  T 

T  x  S 

S  x  R 

R  X  G 

G  x  S 

Tx  R 

G  x  T  x  S 

T  x  S  x  R 

S  X  R  x  G 

R  X  G  x  T 

Residual 

g-1 

t  — 1 

s— 1 

r— 1 

(g-l)(t-l) 
(t-l)(s-l) 
(s-l)(r-l) 
(r-l(g-l) 
(g-l)(s-l) 
(t-l)(r-l) 
(g-l)(t-l)(s-l) 
(t  — l)(s— l)(r— 1) 
(s-l)(r-l)(g-l) 
(r-l)(g-l)(t-l) 
(g-l)(t-l)(s-l)(r-l) 

<To2  4"  rOgts*  4"  tCsrg1  4~  SOrgt' 

4-  sragt2  4-  tSCTrg8  4-  rtogs* 
4-  tsraga 

Oo2  4"  ragts2  4~  g^tsr2  4"  Sargt* 

4-  sragt2  4-  rgast2  +  gsart1 

4-  gsrat2 
a02  4-  ragts"  4"  g^tsr"  4~  ta28rg 

+  rgats2  -f  gtasr*  4~  rtags8 

4-gtras2 
a02  4"  gatsr2  4"  Sa2rgt  4"  ta'srg 

4-  gta2sr  4-  tsa2rg  4-  gsa"rt 

4-  Sgtar2 
a02  4-  sargt2  4-  ra2gts  4-  sra2gt 
a02  4-  ra2gts  4-  ga2tsr  +  rga*t* 
a02  4"  g<*2tsr  4"  ta2srg  4"  gta28r 
a02  4"  ta2srg  4-  sa2rgt  4"  tsa'rg 
a02  4-  ra2gts  4~  ta2Srg  4"  rta2g8 

Oo2  4"  g°2tsr  4-  Sa2rgt  4"  gSa2rt 

oo2  4-  ra2gts 
aQ2  4-  g<J2tsr 
a02  4"  ta2Srg 
a02  4-  sa2r8t 
ao2 

Total 

gtsr—  1 

The  appropriate  tests  of  significance  can  be  worked  out  from  the  components 
of  variance. 
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(ii)  The  data  below  is  from  a  four  factor  experiment,  each  factor  at  two  levels. 
The  experiment  is  on  a  purification  process,  and  high  figures  for  the  dependent 
variable  correspond  to  an  inferior  product.  The  treatments  were  such  that  the 
crude  material  could  be  given  a  hot  or  cold  wash  (Hx  or  H2),  then  given  a  period 
of  boiling  or  not  given  a  period  of  boiling  (B2  or  Bj),  then  dissolved,  filtered,  and 
precipitated  from  either  of  two  solvents  (Sx  or  S2),  and  finally  given  a  cold  or  hot 
wash  (Wx  or  W2). 


Si 

s, 

w1 

w2 

*l 

w2 

Bi 

B2 

Bi 

B2 

Bi 

B2 

Ba 

B2 

Hx 

H8 

Hj 

H2 

Hl 

H2 

Hx 

H2 

Hi 

H2 

Hi 

H2 

Hi 

H2 

Hi 

Ha 

4.2 

2-7 

1.4 

1.3 

3-3 

3-5 

1-3 

1.9 

2.2 

3-1 

4.0 

4.1 

5-o 

3-o 

2.2 

2.6 

17 

2 

—11 

— 12 

8 

10 

— 12 

—6 

—3 

6 

IS 

16 

25 

5 

—3 

I 

In  carrying  out  the  analysis  of  variance,  for  convenience  2.5  was  subtracted 
from  each  figure,  and  the  results  multiplied  by  10  to  remove  the  decimal  point, 
this  giving  the  figures  in  the  last  row.  The  effect  of  this  transformation  is  that 
all  the  sums  of  squares  in  the  analysis  of  variance  are  10a  =  100  times  too  large. 

We  proceed  (*)  as  follows  : — 

(1)  We  form  the  four  tables  obtained  by  summing  over  each  of  the  four  variables 
in  turn.     Thus  the  table  obtained  by  summing  over  S  is  as  below : 


w, 

w2 

Bi 

B2 

Bi 

B, 

Hi 

Ha 

Hx 

H2 

Hx 

H^ 

Hi 

H2 

14 

8 

4 

4 

33 

15 

—15 

—5 

The  table  obtained  by  summing  over  W  is  as  below  : 


Si 

sa 

Bi 

B2 

Bi 

B2 

Hi 

H2 

Hi 

H2 

Hi 

H2 

Hi 

H2 

25 

12 

—23 

—18 

22 

11 

12 

17 

The  other  two  three-way  tables,  obtained  by  summing  over  B  and  over  H, 
are  similar. 

f1)  For  an  alternative  method  of  calculating  the  analysis  of  variance  of  a  2n  experiment 
see  page  138. 
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(2)  We  form  the  six  tables  obtained  by  summing  over  the  variables  two  at  a  time. 
Thus  the  tables  obtained  by  summing  over  S  and  H,  and  over  S  and  B,  are  as 
below : 


Summed  over  H  and  S 


Summed  over  S  and  B 


Bi           B2 

Totals 

Wi 

w2 

22              8 
48        —20 

30 
28 

Totals 

70        —12 

58 

Hi           H, 

Totals 

Wi 

w2 

18            12 
18            10 

30 
28 

Totals 

36           22 

58 

The  other  two-way  tables  are  as  below : 
Summed  over  H  and  W 


Bi            B2 

Totals 

Si 

sa 

37        —41 
33            29 

-4 
62 

Totals 

70        —12 

58 

Summed  over  B  and  W 


Hi           H, 

Totals 

Si 

sa 

2         —6 
34           28 

—4 
62 

Totals 

36           22 

58 

Summed  over  B  and  H 


Summed  over  S  and  W 


Wi           W, 

Totals 

Si 

s2 

—4             0 

34            28 

—4 
62 

Totals 

30            28 

58 

Hi           H2 

Totals 

Bi 
B2 

47           23 
—11          —1 

70 
—12 

Totals 

36           22 

58 

(3)  We  take  the  square  of  the  grand  total  and  divide  it  by  the  grand  total  number 
of  observations : — 

582/16  =  210.25 
We  shall  use  this  quantify  a  great  deal,  and  refer  to  it  as  "the  correcting 
factor,"  as  it  corrects  all  our  sums  of  squares  for  the  fact  that  they  are  deviations 
from  the  zero,  not  the  mean. 

(4)  To  form  the  Sum  of  Squares  for  the  main  effect  of  the  factor  S,  we  square 
the  S  totals,  sum  these  squares,  and  divide  by  the  number  of  original  individuals 
forming  each  S  total,  and  subtract  from  this  the  correcting  factor,  i.e. 

[(—  4)2  +  622]/8  —  210.25  =  272.25 
The  other  main  effects  are  obtained  similarly.     Thus — 
For  W  :  (30a  +  282)  /8  —  210.25  =  0.25 
For  B  :    (702  +  (— 12)2)/8  —  210.25  =  420.25 
For  H  :    (36a  +  222)/8  —  210.25  =  12.25 

(5)  The  first  order  interactions  measure  the  extent  to  which  the  effect  of  one 
factor  depends  upon  the  value  of  the  other  factor.  Thus  there  are  three  first 
order  interactions  involving  S,  namely  S  X  W,  S  X  B,  and  S  X  H. 

The  Sum  of  Squares  corresponding  to  the  first  mentioned,  S  X  W,  is  obtained 
by  squaring  the  individuals  in  the  two  way  table  for  S  and  W,  summing  these 
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squares,  and  dividing  by  the  number  of  original  individuals  making  up  the  indi- 
viduals in  this  table,  and  from  this  subtracting  the  correcting  factor,  the  S  Sum 
of  Squares,  and  the  W  Sum  of  Squares,  i.e. 

[(_4)2  +  02  +  342  +  282)]/4  —  210.25  —  272.25  —  0.25  =  6.25 
Similarly,  the  B  X  H  interaction  sum  of  squares -is  given  by 

(472  +  232  +  (—  ll)2  +  (— 1)2)/4  —  210.25  —  420.25  —  12.25  =  72.25 
The  other  first  order  interactions  are  derived  likewise. 

(6)  It  will  be  apparent  that  there  is  a  possibility  that  the  effect  of,  say,  the  B  X  H 
interaction  may  depend  upon  the  value  of  W  or  of  S.  The  former  is  termed  the 
S  X  B  X  H  interaction  and  the  latter  the  W  X  B  X  H  interaction.  Interactions 
of  this  type  are  known  as  second  order  interactions. 

The  sums  of  squares  for  these  terms  are  derived  from  the  three  way  tables. 
Thus  to  derive  the  Sum  of  Squares  for  the  S  X  B  X  H  interaction  we  first  square 
the  individuals  in  the  S  X  B  X  H  table,  sum  these  squares,  and  divide  by  the 
number  of  original  individuals  making  up  each  individual  in  the  table,  and  sub- 
tracting the  correcting  factor,  i.e. 

(252  +  122  +(— 23)2  +  (— 18)2  +  222  +  lla  +122  +172)/2  —  210.25 
=  1119.75 
From  this  figure  we  subtract  the  sums  of  squares  for  all  the  main  effects  and  all 
the  first  order  interactions  which  do  not  involve  the  odd  variable.     Thus  with 
the  S  X  B  X  H  sum  of  squares  the  odd  variable  is  W,  so  we  subtract  from 
1119.75  the  sums  of  squares  for  the  S,  B,  H,  S  X  B,  B  X  H,  and  H  X  S  terms,  i.e. 

1119.75  —  272.25  —  420.25  —  12.25  —  342.25  —  72.25  —  0.25  =  0.25 
The  other  second  order  interactions  are  obtained  similarly. 

(7)  The  Total  Sum  of  Squares  is  the  difference  between  the  sum  of  the  squares 
of  the  original  individuals  and  the  correcting  factor,  i.e. 

172  +  2*  +  (— ll)2  +  .  .  .  +  (— 3)2  +1»-  210.25  =  1877.75 

(8)  The  Sum  of  Squares  corresponding  to  the  Residual  is  obtained  by  subtracting 
the  four  main  effects,  the  six  first  order  interactions,  and  the  four  second  order 
interactions  from  the  Total  Sum  of  Squares. 

We  thus  have  a  complete  analysis  as  in  Table  11.9  below.  The  degrees  of 
freedom  are  of  the  following  form  (small  letters  represent  the  number  of  levels 
at  which  the  factor  denoted  by  the  capital  letter  occurs). 

Table  11.9 


Source  of  Variance 

Sums  of  Squares 

B 
H 

S 

w 

B  x  H 

H  x  S 

S  x  w 

W  x  B 

W  x  H 

B  x  S 

B  X  H  X  S 

H  x  S  x  W 

S  X  W  x  B 

W  x  B  X  H 

Residual 

420.25 

12.25 

272.25 

0.25 
72.25 

0.25 

6.25 
182.25 

0.25 
342.25 

0.25 

156.25 

272.25 

30.25 

110.25 

Total 

1877.75 
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(a)  for  the  main  effects,  e.g.  for  B,  (b  —  1) 

(b)  for  the  first  order  interactions,  e.g.  for  H  X  B,  (h  —  1)  (b  —  1) 

(c)  for  the  second  order  interactions,  e.g.   for  H  X  B  X  W,  (h  —  1) 

(b-l)(w-l) 

(d)  for  the  residual,  (h  —  1)  (b  —  1)  (w  —  1)  s  —  1) 

(e)  for  the  total,  (h  b  w  s  —  1). 

In  the  present  case,  the  degrees  of  freedom  are  in  every  case  1  except  for 
the  total.  (2)  Accordingly,  the  column  for  Degrees  of  Freedom  is  omitted.  Also, 
since  the  Sums  of  Squares  thus  equal  their  respective  Mean  Squares  the  latter 
column  is  omitted. 

Analysing  the  various  terms  it  is  clear  that  the  W  X  B  X  H  and  B  X  H  X  S 
interactions  do  not  exist,  so  we  can  pool  them  with  the  residual  giving  a  new 
residual  of  140.75/3  =  46.92  with  3  degrees  of  freedom.  The  HxSxW 
interaction  is  not  significant,  and  we  get  a  new  residual  of  297.00/4  =  74.25  with 
4  degrees  of  freedom.  Tested  against  this  the  S  X  W  X  B  interaction  does  not 
reach  the  5%  level  of  significance,  but  it  is  suspiciously  large,  and  it  is  desirable 
to  split  the  experiment  up  into  two  three  factor  experiments  to  check  that  the 
factors  are  not  interacting. 

With  one  second  order  interaction  significant,  say  the  S  X  W  X  B  interaction, 
there  is  a  choice  of  three  methods  of  splitting,  viz.  H  X  W  X  B  for  each  level 
of  S,  H  X  S  X  W  for  each  level  of  B,  or  H  x  S  X  B  for  each  level  of  W. 

Which  method  of  breakdown  we  employ  is  largely  a  matter  of  choice  ;  some- 
times it  may  be  informative  to  employ  more  than  one.  Here  one  particular 
method  of  breakdown  is  strongly  indicated,  namely  according  to  the  solvent  used 
(factor  S). 

Accordingly,  we  carry  out  two  three  factor  analyses,  one  for  Sx  and  the  other 
for  S2.     The  results  of  such  analyses  are  below  in  Table  11.10. 


Table  11.10 


Source  of 
Variance 

Mean  Squares 

Experiment  with 
SatSj 

Experiment  with 
SatSa 

B 

H 

W 

B  x  H 

H  x  W 
W  x  B 
Residual 

760.5 

8.0 

2.0 

40.5 

72.0 

4.5 

12.5 

2.0 

4.5 

4.5 

32.0 

84.5 

450.0 

128.0 

Total 

900.0 

705.5 

(a)  In  this  respect  the  present  example  constitutes  a  special  case.  The  adaptation  of 
these  methods  of  computation  to  the  more  general  case  is  quite  straightforward.  The 
formulae  for  the  degrees  of  freedom  have  just  been  given.  With  regard  to  the  sums  of 
squares,  if  we  have  a  four  factor  experiment  with  factors  A,  B,  C,  and  D  at  2,  3,  4,  and  5 
levels  respectively,  there  are  two  A  totals  each  the  total  of  3  x  4  x  5  =60  observations, 
so  the  sum  of  the  squares  of  the  A  totals  is  divided  by  60  before  having  the  correcting  factor 
subtracted  from  it  to  give  the  A  sum  of  squares.  There  are  five  D  totals,  each  the  total 
of  2  x  3  x  4  =  24  observations,  so  the  division  of  the  sum  of  the  squares  of  the  D  total  is  24. 
For  the  interaction,  for  example,  the  B  x  C  table  has  3x4  =  12  entries,  each  the  total  of 
2x5  =10  observations  ;  accordingly  the  sum  of  the  squares  of  these  12  entries  is  divided 
by  10  before  having  subtracted  from  it  the  correcting  factor  and  the  B  and  C  Sums  of  Squares 
to  give  the  B  x  C  Sum  of  Squares. 
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The  Degrees  of  Freedom  for  each  of  the  terms  except  the  Total  are  1  in  every 
case,  so  the  Mean  Squares  equal  their  respective  Sums  of  Squares,  and  therefore 
columns  for  Degrees  of  Freedom  and  Sums  of  Squares  are  omitted  from  the  above 
table. 

Let  us  consider  the  Sx  experiment  first.  The  W  X  B  interaction  is  not 
significant,  and  neither  is  the  B  X  H  interaction.  The  H  X  W  interaction  is  also 
found  to  be  non-significant.  Clearly  the  H  and  W  main  effects  are  non-significant. 
Pooling  all  these  non-significant  terms  we  get  a  residual  of  23.2  with  6  degrees 
of  freedom.  The  only  term  left  is  the  B  main  effect,  giving  a  variance  ratio  of 
760.5/23.2  =  32.8  with  degrees  of  freedom  nx  =  1,  n2  =  6.  This  lies  near  the 
0.1%  level  of  significance. 

We  can  thus  conclude  that  the  B  factor  has  an  effect  upon  the  quality  of  the 
product,  Bx  giving  a  mean  of  3.47  and  B2  a  mean  of  1.47.  The  significance  of 
the  other  factors  W  and  H  has  not  been  established,  the  means  for  Hx  and  H2 
being  2.55  and  2.35  respectively  and  for  Wr  and  W2  being  2.4  and  2.5  respectively. 

It  thus  seems  a  matter  of  indifference  which  level  of  H  and  of  W  we  use, 
but  it  is  very  important  to  select  the  right  level  of  B. 

Now  let  us  consider  the  S2  experiment.  We  have  a  large  residual.  The 
B  X  H  and  H  X  W  interactions  cannot  be  significant,  and  pooled  give  a  new 
residual  for  81.5  with  3  degrees  of  freedom.  The  H  main  effect  does  not  contain 
amongst  its  components  of  variance  awt>2,  which  is  contained  amongst  the  com- 
ponents of  variance  of  the  outstanding  interaction  W  X  B  :  the  H  term  is  non- 
significant and  can  therefore  be  pooled  to  give  a  new  residual  of  62.25  with  4 
degrees  of  freedom.  Tested  against  this  the  W  X  B  interaction  gives  a  variance 
ration  of  450.00/62.25  =  7.2  with  degrees  of  freedom  nx  =  1,  n2  ==  4,  this  being 
close  to  the  5%  level  of  significance. 

The  fact  that  the  W  X  B  interaction  is  marked  for  S2  but  non-existent  for 
Sx  implies  the  existence  of  a  W  X  B  X  S  interaction,  which  we  had  suspected 
from  the  four  factor  analysis.  The  non-existence  of  a  B  main  effect  for  S2  as 
compared  with  its  great  significance  for  Sx  indicates  the  existence  of  a  S  X  B 
interaction  :  the  four  factor  analysis  had  showed  that  this  exists  but  actually  in 
the  more  complicated  guise  of  an  S  X  B  X  W  interaction. 

Here  we  have  a  three  factor  analysis  with  a  significant  first  order  interaction, 
W  X  B.  Accordingly  we  need  to  break  the  three  factor  experiment  down,  either 
into  W  X  H  experiments  for  Bx  and  for  B2  or  into  B  X  H  experiments  for  Wx 
and  for  W2. 

The  results  of  the  former  are  below  in  Table  11.11.  Here  again  the  Degrees  of 
Freedom  for  each  term  except  the  total  are  1,  so  that  column  and  the  Mean  Square 
column  are  omitted. 

Table  11.11 


Source  of  Variance 

Sums  of  Squares 

B 
Bx 

at 

B2 

.  H 
W 
Residual 

30.25 
182.25 
210.25 

6.25 

272.25 

2.25 

Total 

422.75 

280.75 
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Considering  the  B2  experiment,  the  H  term  is  not  significant,  and  can  be 
pooled  to  give  a  new  residual  of  8.50/2  =  4.25  with  two  degrees  of  freedom. 
The  W  term  then  gives  a  variance  ratio  of  272.25/4.25  =  64.0  for  nx  =  1,  n2  =  2, 
this  being  more  significant  than  5%. 

We  can  thus  conclude  that  using  the  solvent  S2  and  treatment  B2  (boiling) 
it  is  a  matter  of  indifference  which  level  of  H  we  use,  but  that  it  is  important 
to  select  the  correct  level  of  W,  the  averages  for  Wx  and  W2  being  4.05  and  2.40 
respectively. 

Turning  now  to  the  Bx  experiment,  we  have  a  large  residual,  which  suggests 
that  the  H  X  W  interaction  exists.  From  the  fact  that  it  clearly  does  not  exist  for 
B  =  B2,  this  suggests  that  it  takes  the  guise  of  a  B  X  H  X  W  interaction.  This  was 
not  shown  up  on  the  full  four  factor  analysis,  but  the  evidence  here  is  suggestive. 
With  such  a  large  residual,  the  H  and  W  terms  will  not  be  significant,  but  we 
may  get  an  idea  of  the  behaviour  of  the  system  by  forming  an  H  X  W  table, 
Table  11.12. 


Table  11.12 
S  =  S2,  B  =  Bx 

Wx             W2 

Hx 
H2 

2.2             5.0 
3.1              3.0 

This  table  suggests  that  with  S2  Blf  we  should  use  Wlf  and  possibly  that 
Hx  is  better.     However,  these  suggestions  cannot  be  made  as  definite  assertions. 

That  is  as  much  as  can  be  done  with  that  method  of  breakdown.  Turning 
to  the  other,  the  results  of  the  analysis  are  below  in  Table  11.13.  The  degrees  of 
freedom  are  1  for  each  term  (except  the  total)  so  that  column  and  the  Mean  Square 
column  are  omitted. 

Table  11.13 


Source  of 
Variance 

Sum  of  Squares 

W 

at 

H 
B 

Residual 

25 

196 

16 

64 
256 
144 

Total 

237 

464 

Considering  the  experiment  for  Wx,  clearly  the  H  term  is  not  significant. 
Pooling  it  we  get  a  new  residual  of  41/2  =  20.5  with  2  degrees  of  freedom.  The 
B  term  then  has  a  variance  ratio  of  196/20.5  =  9.55  for  nx  =  1,  n2  =  2.  This 
is  considerably  more  significant  than  the  20%  level  but  less  significant  than  the 
5%.     However,  we  may  regard  it  provisionally  as  real. 

We  can  thus  conclude  that  using  solvent  S2  and  treatment  Wj  it  is  a  matter 
of  indifference  which  level  of  H  we  use  but  it  is  important  to  use  the  correct  level 
of  B,  the  means  for  Bx  and  B2  being  2.65  and  4.05  respectively. 
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Considering  the  experiment  for  W2,  we  have  a  large  residual,  and  therefore 
we  cannot  expect  significance  for  the  H  and  B  terms.  However  we  may  get  an 
idea  of  the  behaviour  of  the  system  by  inspecting  a  B  X  H  table,  as  below. 


s  = 

sitw  = 

w% 

Bx 

Ba 

H2 

5.0 
3.0 

2.2 
2.6 

This  table  suggests  that  with  S,  W2,  we  should  use  B2,  and  possibly  Hx  is 
better. 

Let  us  now  see  whether  the  two  methods  of  breakdown  give  consistent 
conclusions.  The  Bx  and  B2  method  gave  that  with  B2  we  should  use  W2  to  get 
a  low  figure  for  the  impurity,  but  that  with  Bx  we  should  use  W1#  The  Wx  and 
W2  method  gave  that  with  Wj  we  should  use  B4,  and  with  W,  we  should  use 
B,;  As  regards  H,  the  Bj  and  B2  method  did  not  give  significant  results,  but  the 
Wx  and  W2  method  suggested  (qualitatively,  not  with  an  exact  test  of  significance) 
that  Hx  should  be  used  to  give  low  figures  for  the  impurity.  The  two  methods 
of  breakdown  thus  lead  to  similar  conclusions. 

The  effect  of  the  W  X  B  interaction  for  S2  is  shown  in  Table  11.14  below. 


Table  11.14 
S=  St(H  averaged) 

Bi               B* 

wx 
w2 

2.65         4.05 
4.0           €.4 

This  table  shows  clearly  that  to  obtain  a  low  figure  for  the  impurity,  if  we 
use  treatment  Bu  then  we  must  use  Wls  and  if  we  use  B2  then  we  must  use  Wt. 
There  is  no  "best"  level  for  either  B  or  for  W :  the  "best"  levels  depend  upon 
the  levels  of  the  other  factor.  Also  solvent  S2  behaves  differently  in  this  respect 
to  Solvent  Slf  for  which  treatment  B2  was  always  best  irrespective  of  the'  values 
of  H  or  W.  The  interaction  shown  above  is  the  B  X  W  interaction,  and  since 
it  operates  for  S2  but  not  for  Sx  we  can  say  that  the  S  X  W  X  B  interaction  exists. 

It  is  of  interest  to  note  the  type  of  error  into  which  we  might  have  fallen  if 
we  had  carried  out  this  experiment  classically  rather  than  factorially.  Classically 
we  would  have  investigated  each  factor  at  one  level.  If  we  were  unlucky  we 
would  have  investigated  B  at  W2  and  W  at  Bx.  From  the  first  experiment  we 
would  have  decided  that  in  order  to  get  a  low  figure  for  the  impurity  we  should 
use  B2  and  from  the  second  experiment  that  we  should  use  Wj.  We  thus  would 
have  decided  to  use  Wt  B2,  an  entirely  faulty  conclusion. 

This  has  been  a  lengthy  analysis,  and  we  might  recapitulate  the  conclusions 
reached : — 

(1)  Using  the  solvent  Sj 

(a)  it  is  a  matter  of  indifference  whether  we  use  Ht  or  H2 

(b)  it  is  a  matter  of  indifference  whether  we  use  Wx  or  W, 

(c)  it  is  desirable  to  use  B2  giving  a  mean  figure  for  the  impurity  of  1.47 
as  compared  with  3.42  for  Bj. 
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(2)  Using  the  solvent  Ss 

(a)  it  is  a  matter  of  indifference  whether  we  use  Hx  or  H,. 

(b)  using  Bx  we  should  use  Wx,  when'  we  get  a  figure  of  2.65  for  the 
impurity. 

(c)  using  B 2  we  should  use  W2,  when  we  get  a  figure  of  2.4  for  the  impurity . 
These  conclusions  have  all  been  reached  rigorously,  with  no  guess  work,  and 

give  a  complete  picture  of  the  behaviour  of  the  system. 

It  will  be  noted  that  this  experiment  has  been  complicated  in  the  sense  that 
there  have  been  second  order  interactions  significant.  It  would  have  been  simpler 
if  only  the  main  effects,  or  first  order  interactions,  had  been  significant.  With 
second  order  interactions  existing,  however,  the  operation  of  the  system  is  com- 
plicated, so  it  is  inevitable  that  its  analysis  should  be  also. 

It  will  also  be  noted  that  with  significant  interactions  the  factorial  experiment 
loses  its  superiority  in  the  sense  of  efficiency  over  the  classical,  and  in  effect  reduces 
to  a  series  of  classical  experiments.  The  point  is  that  if  a  factorial  design  is  not 
used,  then  these  interactions  cannot  be  detected. 

These  questions  are  discussed  further  in  Chapter  XII  (d),  but  meantime  it 
must  be  noted  that  Tables  11.11  and  11.13  must  be  considered  not  satisfactory 
owing  to  the  inadequate  number  of  degrees  of  freedom.  When  these  high  order 
interactions  are  significant  we  require  more  experiments  than  were  made  in  this 
instance. 

(f)  The  Five  Factor  Analysis 

(i)  In  the  previous  section,  we  had  four  factors,  source  of  raw  material  (R), 
supplier  of  polymer  (S),  grade  of  polymer  (G),  and  temperature  of  Processing  (T). 
Let  us  now  assume  there  is  a  fifth  factor,  namely,  pressure  of  processing  (P). 

The  procedure  for  five  factor  analysis  of  variance  can  be  readily  generalised 
from  the  two,  three,  and  four  factor  analyses  discussed  earlier : — 

(1)  Sum  over  each  of  the  variables  in  turn.    This  will  give  us  five  tables,  each 
containing  four  variables. 

(2)  Sum  over  the  variables  taken  two  at  a  time.    This  will  give  us  ten  tables, 
each  containing  three  variables. 

(3)  Sum  over  the  variables  taken  three  at  a  time.    This  will  give  us  ten  tables, 
each  containing  two  variables. 
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(4)  Derive  the  marginal  totals  of  the  tables  (3).     This  is  equivalent  to  sum- 
ming over  the  variables  taken  four  at  a  time. 
,    {5)  Sum  over  the  variables  taken  five  at  a  time,  i.e.  take  the  grand  total. 
.  (6)  Sum  the  squares  of  all  the  figures  in  the  tables  described  in  (1)  to  (5) 
and  divide  each  sum  of  squares  by  the  number  of  original-  individuals 
forming  the  individuals  in  each  table.     Thus,  in  the  table  of  class  (3) 
formed  by  summing  over  G,  T,  and  S,  i.e.  the  table  for  R  and  P,  the 
divisor  is  gts. 
y/  (7)  The  main  effects  are  given  by  subtracting  (5)  from  each  of  the  sums  of 
squares  in  (4)  (both  having  been  divided  by  their  appropriate  divisors). 
(8)  To  get  say  the  GTS  interaction,  we  take  the  table  for  G,  T,  and  S  (i.e.  the 
table  of  class  (2)  summed  over  R  and  P),  take  its  sum  of  squares  divided 
by  the  appropriate  divisor  (here  rp)  and  subtract  from  this  the  correcting 
factor  (5)  divided  by  its  divisor  (gtsrp)  and  also  subtract  the  sums  of 
squares  already  derived  for  G,  T,  S,  S  X  G,  G  X  T  and  T  X  S. 
Studying  the  components  of  variance  for  a  four  factor  analysis  in  Table  11.8, 
and  comparing  them  with  Table  1 1.2  for  three  factors  and  Table  1 1.1  for  two  factors, 
we  can  readily  see  how  for  a  five  factor  experiment  the  components  of  variance 
would  be  built  up  : — 

(a)  The  highest  order  interaction,  used  as  a  residual,  a0*  has  no  coefficient. 
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(b)  The  interactions  of  the  next  highest  order  have  o0*  and  the  component 
corresponding  to  the  interaction,  the  latter  having  as  its  coefficient  the 
letters  not  included  in  the  subscripts  of  the  interaction.  Thus  the 
G  X  T  X  S  X  R  components  are 

pCgtsr2  4-  Co* 

(c)  The  interactions  of  the  next  highest  order  have  o0",  the  component 
corresponding  to  the  interaction,  and  all  the  intermediate  interactions 
whose  suffices  include  the  interaction  suffices.  The  general  rule  for 
coefficients  applies  throughout :  the  coefficients  are  those  letters  which 
are  not  included  in  the  subscripts. 

Thus  the  G  X  T  X  S  components  are  : — 
rpagts2  -{-  rcr2gt8p  +  pa2gtsr  +  a0* 

(d)  The  first  order  interactions  are  built  up  according  to  the  same  general 
rules.     Thus  the  S  X  R  components  are  : — 

gtpoV  +  gt02psr  +   tpa2gsr   +    pg<*2srt  +    ga  2p8rt  -f-   ta2g8rP   -f   pa2gsrt 
-f  ao2 

(e)  The  main  effects  are  built  up  similarly.  Thus,  the  S  main  effect  includes 
a8a,  the  first  order  interactions  of  S,  namely,  o2Sg,  a28r,  o28p  and  o2st,  the 
second  order  interactions  of  S,  namely  a2sgr,  a28gp,  a28rp,  a2stp  o28tr  and 
c2stg,  the  third  order  interaction  of  S,  namely  a2rg3t,  a2strp,  o2srPg  and 
o2sPgt,  and  the  residual  o02.  All  these  components  have  their  coefficients 
determined  by  the  general  rule  mentioned  earlier :  thus  o82  has  as  its 
coefficient  prgt. 

(ii)  A  five  factor  experiment  was  carried  out  on  a  process  for  purifying  a 
crystalline  product,  all  factors  being  at  two  levels.  The  quantity  of  the  substance 
per  cycle  (T)  was  tested,  one  level  actually  corresponding  to  double  the  concentra- 
tion of  the  other.  The  time  to  dissolve  (D)  the  crystalline  substance  in  the  solvent 
was  tested,  one  level  being  four  times  the  time  of  the  other  level.  Time  was 
relevant  because  the  dissolving  was  done  in  hot  solution  and  as  the  formation  of 
impurities  is  auto -catalytic  a  long  period  at  a  high  temperature  might  have  dele- 
terious effects.  The  third  factor  W  represented  two  slightly  different  liquids 
used  in  the  next  stage  of  the  process,  in  which  there  were  two  speeds  of  stirring 
(S)  and  two  times  (G)  to  precipitate  the  pure  crystal  by  steam  distillation. 

The  original  data  is  given  in  Table  11.15.  The  decimal  point  has  been 
removed  and  large  figures  represent  an  inferior  products. 

Table  11.15 
Original  Data  on  Five  Factor  Crystal  Purification  Experiment. 


w, 

w2 

Gx 

G2 

Gx 

G2 

Dx 

D2 

DA 

D2 

Dx 

D2 

Dx 

D, 

Tx 

Sx 

138 

108 

113 

95 

103 

137 

106 

116 

s2 

116 

119 

106 

114 

122 

122 

128 

148 

T3 

Sx 

155 

170 

131 

172 

146 

152 

135 

144 

s2 

133 

172 

138 

181 

150 

156 

124 

146 
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In  the  analysis  of  variance,  all  factors  being  at  two  levels,  the  degrees  of 
freedom  for  all  terms  are  unity,  and  hence  the  Mean  Squares  are  equal  to  the 
Sums  of  Squares.  In  Table  11.16  giving  the  Analysis  of  Variance  below  the  fourth- 
order  interactions  have  not  been  calculated  but  have  been  pooled  with  the  Residual 
whose  degrees  of  freedom  are  thus  (1  +  5)  =  6.  Strictly  speaking,  these  inter- 
actions should  be  calculated,  but  in  practice  it  is  difficult  to  attach  any  great 
importance  to  them,  and  it  is  the  effects  earlier  in  the  table  which  are  of  practical 
interest.  In  Table  11.16  the  figures  31  and  6  in  brackets  after  the  Total  and  the 
Residual  are  the  numbers  of  degrees  of  freedom  for  the  Total  and  Residual 
respectively. 

Table  11.16 
Analysis  of  Variance  for  Five  Factor  Experiment  on  Crystal  Purification. 


Source  of  Variance 

Sums  of  Squares 

S 

91.1 

G 

325.1 

W 

21.1 

T 

8256.1 

D 

1352.0 

S    X  G 

264.5 

G  x  W 

12.6 

W  x  T 

924.5 

T  x  D 

741.2 

D  x  S 

171.1 

G  x  T 

18.0 

W  x  D 

1.2 

T  x  S 

128.1 

D  x  G 

120.1 

Wx  S 

112.5 

S    x  G  x  W 

21.1 

G  X  W  X  T 

406.1 

W  x  T  x  D 

1200.4 

T  x  D  x  S 

0.4 

D  x  S    x  G 

24.6 

S    x  W  x  T 

91.1 

S    x  G  x  T 

91.1 

T  x  D  x  G 

40.5 

G  x  W  x  D 

31.9 

D  x  W  x  S 

288.0 

Residual  (6) 

661.6 

Total  (31) 

15396.0 

It  is  apparent  that  all  the  second  order  interactions  except  W  X  T  X  D  are 
non-significant,  but  the  latter  is  large.  To  test  W  x  T  X  D,  we  must  note  that 
its  components  of  variance  are  included  amongst  the  components  of  variance  of 
W  X  T,  T  X  D,  D  X  W,  W,  T,  and  D,  and  therefore  the  latter  terms  cannot 
be  less  than  W  X  T  X  D.  Of  these  latter  terms,  only  the  T  and  W  main  effects 
are  greater  than  W  X  T  X  D.  Accordingly  we  pool  the  others  with  W  X  T  X  D 
and  obtain  a  new  estimate  of  the  latter  of  2888.4/5  =  577.7  with  5  degrees  of 
freedom.  Testing  this  against  the  residual  of  1656.4/15  =  110.4  with  15  degrees 
of  freedom,  we  get  a  variance  ratio  of  5.2  for  nx  =  5,  n2  =  15,  corresponding  to 
a  probability  of  1%.  Since  T  is  the  more  prominent  main  effect,  it  would  seem 
most  profitable  to  break  down  the  five  factor  analysis  into  two  four  factor  analyses 
for  each  level  of  T,  as  in  Table  11.17. 

Examining  the  terms  for  Tx,  starting  from  the  Residual  and  working  towards 
the  top  of  the  table,  we  find  none  of  them  significant.     Tx  actually  corresponds 
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to  the  use  of  the  dilute  solution.  Under  these  conditions,  therefore,  we  are  at 
liberty  to  choose  the  four  factors  at  whatever  levels  may  be  convenient  on  prac- 
tical grounds  secure  in  the  knowledge  that  this  will  not  have  a  deleterious  effect 
upon  the  purity  of  the  product. 

Table  11.17 
Analysis  of  Variance  for  two  four  factor  experiments  in  Crystal  Purification. 


Source  of  Variance 

Sums  of  Squares 

Tx 

T, 

W 

S 

G 

D 
|     W  x  S 

S    x  G 

G  x  D 

D  x  S 

W  x  D 

G  x  W 

W  x  S    x  G 

G  X  S    x  D 

S    X  D  x  W 
!     W  x  G  xD 

Residual 

333.06 

217.56 

95.06 

45.56 

203.07 

333.07 

10.57 

76.57 

637.57 

280.57 

45.55 

85.55 

430.55 

27.55 

162.58 

612.56 

1.56 

248.06 

2047.56 

0.57 

22.57 

150.07 

95.07 

564.07 

138.07 

175.55 

5.08 

10.55 

7.55 

76.55 

Total 

2984.44 

4155.44 

Reverting  to  Table  11.16,  it  will  be  noted  that  the  T  main  effect,  when  tested 
against  W  X  T  X  D  (all  the  first  order  interactions  involving  T  are  non-significant) 
is  significant  at  the  5%  level,  nearly  at  the  1%  level.  The  mean  for  Tj  is  lower 
than  T2,  and  thus  for  purity  we  should  choose  Ta.  However,  since  the  charge 
is  only  half  of  that  for  T2,  the  throughput  of  material  through  the  plant  will  be 
only  half,  and  it  may  be  necessary  therefore  to  use  Tt  if  at  all  possible. 

Reverting  to  Table  11.17,  and  examining  the  analysis  for  T2,  we  find  W  X  D 
is  at  the  5%  level  of  significance,  and  accordingly  separate  analyses  were  made 
for  each  level  of  D,  as  in  Table  11.18. 

Table  11.18 


Source  of  Variance 

Sums  of  Squares 

D, 

Da 

G 
W 

S 

G  x  W 

W  X  S 

S    X  G 

Residual 

392.0 

0.5 

60.5 

40.5 

8.0 

24.5 

242.0 

6.15 

1176.15 

36.15 

105.10 

3.10 

3.10 

10.15 

Total 

768.0 

1339.90 
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For  the  Dt  Series,  all  terms  are  non-significant  with  the  possible  exception 
of  G,  the  means  for  Gx  and  G2  being  146  and  132  respectively.  For  the  Dt 
series,  on  the  other  hand,  the  only  prominent  term  is  W,  the  means  for  W,  and 
Wt  being  173.8  and  149.5  respectively. 

We  may  recall  that  using  Tlt  with  any  value  of  D,  W,  S  and  G,  the  mean 
obtained  was  118.2,  so  as  far  as  purity  goes  we  prefer  TV  If  we  use  T8,  however, 
we  should  prefer  D„  when  we  will  use  any  value  of  G,  W,  and  S  and  get  an  average 
purity  of  139.  Should  we  need  to  use  D2,  then  we  can  use  either  G  and  S  but 
it  is  important  to  use  W„  giving  an  average  of  149.5. 

It  is  clear  that  the  complicated  behaviour  of  this  system  could  not  possibly 
have  been  followed  without  the  use  of  a  factorial  design.  Also,  the  analysis  of 
variance  was  absolutely  necessary  in  order  to  identify  those  effects  and  interactions 
which  were  significant.  It  would  have  been  quite  impossible  to  do  this  by 
qualitative  inspection  of  the  results  in  Table  11.15. (8) 

(g)  Incomplete  Two -Factor  analysis  :  One  Factor  with  Replication 

In  our  two  factor  analysis  (Section  (c)),  we  have  g  grades  of  material  being 
processed  at  t  different  temperatures.  Suppose,  now,  that  we  make  t  batches  all 
at  the  same  temperature  from  each  of  the  g  grades.  Under  these  circumstances, 
the  first  unit  of  Grade  1  has  no  more  correspondence  with  the  first  unit  of  Grade  2 
than  it  has  with  the  second  unit  of  Grade  2,  etc. 

The  two-factor  analysis  has  thus  become  one  factor  (Grades)  with  t  replica- 
tions. This  amounts  to  pooling  the  degrees  of  freedom  and  the  sums  of  squares 
of  the  T  main  effect  in  Table  11.1  with  the  Residual.     This  gives  us  Table  11.19  : 

Table  11.19 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of 
Variance 

G 
Residual 
(within  G's) 

g-i 
(t_i)+(t_i)(g_i) 

=  g(t-i) 

Total 

gt-i 

It  is  this  type  of  analysis  which  was  discussed  in  detail  in  Chapter  VII 
Section  (c). 


(h)  Incomplete  Three-Factor  Analysis  :  Two  Factors  with  Replication 

(i)  In  our  complete  three  factor  analysis  we  had  s  suppliers  each  supplying 
g  grades  of  polymer  which  were  processed  at  each  of  the  t  temperatures.  Let 
us  now  assume  that  t  batches  were  processed  at  the  same  temperature,  i.e.  we 
have  two  factors  G  and  S,  with  t  replications.  Considering  Table  11.2,  we  have 
that  the  G  X  T  and  T  X  S  interactions  and  the  T  main  effect  all  do  not  exist. 
The  reason  for  the  interactions  mentioned  disappearing  is  that,  for  example,  the 
G  main  effect  is  by  definition  the  same  for  all  the  replications,  and  any  irregularity 
in  it  is  synonymous  with  error,  for  which  we  use  the  residual.  We  then  obtain 
Table  11.20. 

Consider  the  problem  in  which  we  wish  to  investigate  a  factor  in  a  process, 
in  which,  say,  we  wish  to  determine  its  effect  at  three  levels,  Flf  F2,  and  F8. 

(•)  An  example  of  a  seven  factor  experiment  on  a  factory  process  will  be  found  in  R.  L. 
Cunningham  and  J.  Ansel  Anderson  :    "Cereal  Chemistry,"  Vol.  XX,  page,  482,  1943. 
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Table  11.20 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of 
Variance 

G 

S 

GxS 
Residual 

(g-1) 
(s-1) 

(g~l)(s-l) 
(g—  l)(s— l)(t  — l)-h(t  — 1)+ 

(g-l)(t-l)-f(s-l)(t-l) 
=  gs(t-l) 

C2o  +  t02g9  -f-  tS02g 

a2o  -f-  to2gs  -f-  tg^28 

ct20  -j-  ta2g8 

a2o 

Total 

gst— 1 

Suppose  that  the  material  at  one  stage  passes  through  a  particular  process  which 
we  know  or  suspect  may  give  rise  to  variability.  Then  it  will  be  necessary  to 
eliminate  the  errors  due  to  this  other  factor. 

The  experimental  design  below  would  satisfy  the  requirements.  A  com- 
parison of  the  means  of  the  Fs  is  valid  since  the  disturbing  effect  of  the  machines 
Mlf  M2  and  Ms  is  eliminated.  The  same  experiment  also  gives  us  a  valid  com- 
parison of  the  three  machines,  Mlt  M2  and  M3.  The  data  from  such  an  experiment 
would  be  analysed  by  the  analysis  of  variance  by  rows  and  columns,  previously 
discussed  in  Chapter  VII  (e)  and  Section  (c)  of  the  present  chapter. 


Machine 

M*              M2              M8 

F2 
F3 

This  experimental  design  has  been  presented  as  a  means  of  eliminating  errors 
due  to  a  particular  factor,  but  of  course  it  is  exactly  the  same  as  for  a  two  factor 
experiment,  i.e.  we  might  be  equally  interested  in  both  the  factors. 

As  the  design  stands  we  cannot  determine  whether  there  is  any  interaction 
between  the  terms,  for  though  we  can  compare,  say  (Mj  —  M2)  for  F  =  Fx  with 
(Mx  —  M2)  for  F  =  F3,  our  comparison  is  pointless  because  we  have  no  estimate 
of  the  error  of  either  term  and  hence  of  their  difference.  Without  the  estimate  of 
the  error  we  cannot  estimate  the  significance  of  any  apparent  difference. 

We  would  get  an  estimate  of  error  if  we  had  repeated  each  experiment,  when 
the  design  would  be  called  a  row  and  column  or  two  factor  analysis  with  replication. 

Consider  the  data  in  the  table  below. 


Ni 

N2 

Totals 

D2 

—12        —5 
—29        —22 

27    27 
34    24 

37 

7 

Totals 

—68 

112 

44 
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It    refers  to  a  two  factor  experiment  N  X  D,  each  factor  at  two  levels,  and 
there  are  two  observations  for  each  point.     We  proceed  as  follows : — 

(1)  Form  the  total  of  each  cell,  square  these  totals,  sum  these  squares,  and 
divide  by  the  number  of  original  individuals  per  cell,  i.e. 

[(—12  — 5)2  +  (—29  —22)2  +  (27  +  27)2  +  (34  +  24)2]/2  =  4585 

(2)  Form  the  total  for  each  level  of  D,  square  these  totals,  sum  these  squares, 

and  divide  by  the  number  of  individuals  per  total  being  squared,  i.e. 
(372  +  72)/4  =  3545 

(3)  Similarly  for  each  level  of  N,  i.e. 

[(— 68)2  +  1122]/4  =  4292 

(4)  Form  the  grand  total,  square  it  and  divide  by  the  grand  number  of 
individuals,  i.e. 

442 /8  =  242. 

(5)  Square  every  individual  and  sum  these  squares,  i.e. 

(— 12)2  +  (— 5)2  +  (27)*  +  . . .  -f  24"  =  4684 
The  table  of  the  analysis  of  variance  is  as  below  : 


Source  of  Variance 

Sums  of  Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Components  of  Variance 

Between  Rows  (D) 

Between  Columns  (N) 

Row  X  Column 
interaction  (D  X  N) 
Residual 

(2)  — (4)  =»  1 12.5 

(3)  —  (4)  -  4050.0 

180.5 
(5)  —  (1)  =  99-o 

nx  —  1  =1 

n2  —  I  —  I 

(ni— i)(n2— 1) 

=  1 
n1n2(n3  — 1) 

=  4 

112.5 

4050.0 
180.5 
24-75 

njjnaOi2  +  n3a122  +  CT2 

nin3o22  +  n3a122  +  a2 

n3<*122  +  <*! 

°o 

Total 

(s)  —  (4)  -  4442.o 

nln2n3  —  * 

=  7 

nx  rows,  n2  columns,  n3  replications. 

©!*   =  variance  due  to  differences  between  rows. 

o2a    as  variance  due  to  differences  between  columns. 

aia*  =  row  X  column  interaction. 

Co     =  residual  error. 

The  Sum  of  Squares  for  the  Row  x  Column  Interaction  is  obtained  by 
subtracting  the  total  of  the  Sums  of  Squares  for  the  other  three  components  from 
the  Total  Sum  of  Squares,  (5)  —  (4). 

In  general,  in  testing  the  mean  squares  for  significance,  the  row  mean  square 
and  the  column  mean  square  are  tested  against  the  row  X  column  interaction, 
for  if  they  are  significantly  greater  than  the  interaction,  this  can  only  be  so  if  axa 
and  a22  respectively  are  greater  than  zero.  Similarly  the  existence  of  the  interaction 
can  be  checked  by  comparing  the  interaction  mean  square  with  the  residual. 

Here  we  test  the  Interaction  against  the  Residual,  getting  a  variance  ratio 
of  180.5/24.75  =  7.3  for  degrees  of  freedom  nx  =  1,  n2  =  4,  this  lying  close  to 
the  5%  level  of  significance. 

Accordingly  we  should  conclude  that  D  and  N  interact,  and  we  should  inspect 
the  data  broken  down  into  one  factor  experiments.  We  can  compare  Dj  with 
D2  at  each  level  of  N,  using  the  Student  t  test  to  compare  the  two  means  for 
D.  We  find  that  the  difference  between  Dj  and  D2  is  not  significant  for  N2 
but  is  quite  considerable  for  Nx.  Similarly  we  can  compare  the  difference  between 
Nj  and  N2  for  each  level  of  D,  when  we  find  that  the  difference  between  the  two 
levels  of  N  is  greater  for  D2  than  for  Dt. 

It  will  be  noted  that  this  two  factor  analysis  with  replication  is  really  a  three 
factor  analysis  with  all  the  terms  involving  the  third  factor  R,  i.e.  the  R  main 
effect  and  the  R  X  N  and  R  X  D  interactions,  pooled  with  the  residual. 
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(i)  Doubly  Incomplete  Three  Factor  Analysis :    One  Factor   with 
Double  Order  Replication 

(i)  In  the  previous  example,  we  had  s  suppliers  each  supplying  g  different 
grades  of  polymer,  and  from  these  t  batches  were  processed,  all  at  the  same  tem- 
perature. Now  suppose  that  the  s  suppliers,  instead  of  each  supplying  g  different 
grades  of  polymer,  supplied  g  nominally  identical  consignments  of  polymer,  all 
of  the  same  grade.     The  system  then  has  the  structure  given  by  the  sketch  below  : 


Supp 

lierl 

Supplier  2 

1       1       1 
.     V    2'    3'  . . . 

Supplier  3 

1 
Consignment  1 

1  1 

2  3 

1 
4    . 

1       1       1 
.       V    2"    2" 

1          1          1 
1        2        3... 

Each  consignment  is  made  up  of  a  number  of  batches  ;  these  are  only  indicated 
for  consignment  1  of  supplier  1,  but  should  be  understood  for  all  the  consignments. 
Since  the  consignments  are  nominally  of  similar  grade,  there  is  no  point  in  com- 
paring the  mean  of  ail  the  Is,  1,  1',  V  etc.  with  the  mean  of  all  the  2s,  2,  2',  2" 
etc.  The  different  consignments  1,  2,  3,  etc.  from  Supplier  1  may  differ  among 
themselves,  however,  and  similarly  for  the  other  suppliers.  We  can  use  the 
consistency  of  the  batches  processed  from  each  consignment  as  a  measure  of 
error  against  which  the  difference  between  consignments  can  be  compared.  It 
may  be  noted  that  should  the  first  batch  in  all  instances  be  processed  at  tem- 
perature tlf  the  second  batch  in  all  instances  at  another  temperature  t2,  etc.,  then 
we  have  an  incomplete  two  factor  analysis  as  in  the  previous  section,  the  factors 
being  supplier  (S)  and  temperature  (R). 

The  variance  can  then  be  analysed  into  the  components  given  in  Table  11.21. 
Table  11.20  is  derived  from  Table  11.21  by  pooling  the  sums  of  squares  and  degrees 
of  freedom  of  the  G  and  G  X  S  terms.  The  component  of  variance  corresponding 
to  the  G  effect  in  Table  11.21  is  more  of  the  nature  of  an  interaction  than  a  main 
effect,  but  we  will  symbolically  use  G  (rather  than  g)  as  a  suffix  to  the  component 
of  variance  to  denote  its  rather  different  character. 

Table  11.21 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of 
Variance 

Suppliers  (S) 
Between  Consign- 
ments    (G) 
Within  consignments 
(Residual) 

(s-1) 
(g-l)  +  (g-l)(s-l) 

;     =s(g-i) 

gs(t-l) 

go2  +  tao2  +  tga2s 

Co  +  tCTG2 
ao2 

Total 

gst— 1 

(ii)  A  certain  type  of  plant  for  concentrating  dilute  acid  was  subject  to  failure 
through  corrosion.  The  figures  below  in  Table  11.22  representthe  quantity  of  acid 
obtained  from  a  number  of  units  before  failure.  From  the  mass  of  data  7  units 
were  selected  from  each  of  9  factories,  and  the  9  factories  fell  naturally  into  three 
groups  according  to  the  type  of  weak  acid  they  were  concentrating. 

To  analyse  this  data  we  form  the  following  terms  : — 

(1)  Square  every  individual  and  sum  all  the  squares,  i.e.  402  +  492  -f-  25* 
+  ...  +  ...  +  ...  +  ...-{-...  +  1222  +  H32  =  339,900 
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Table  11.22 


Type  of 
Weak  Acid 

Factory 

Throughputs 

Totals 

A 

1 
2 
3 

40 

40 
38 

49 
46 
38 

25     30    32    25 
41     30    50    39 
32     33     42    45 

22 
31 
44 

223 
277 
272 

B 

4 
5 
6 

16 

49 
68 

50 

35 
20 

40    54    80    64 
70    64    23     70 
66    36    34    45 

59 

88 
40 

363 
399 
309 

C 

7 
8 
9 

90 

84 

152 

90 
93 
70 

104    62    78     88 

73     80  210  137 

122  143     72  122 

93 

89 

113 

605 
766 
794 

(2)  Obtain  the  total  for  each  block  (factory),  square  these  totals,  sum  these 
squares,  and  divide  by  the  number  of  individuals  in  each  block,  i.e.  (223 2  -+• 
2772  +  . . .  +  7942)/7  =  310,016 

(3)  Obtain  the  total  for  each  super-block  (type  of  weak  acid),  square  these 
totals,  sum  these  squares  and  divide  by  the  number  of  individuals  in  each  super- 
block,  i.e.  [(223  +  277  +  272)2  +  (363  +  399  +  309)2  +  (605  +  766  +  794)2]/21 
=  306,202 

(4)  Obtain  the  grand  total  for  all  individuals,  square  it,  and  divide  by  the 
grand  number  of  individuals,  i.e.  (223  +  277  +  272  +  363  +  ...  +  ...  +  .. . 
-f  794)2/63  =  254,985 

From  these  terms  we  form  the  analysis  of  variance  below  in  Table  11.23. 


Table  11.23 


Source  of 
Variance 

Degrees  of 
Freedom 

Sums  of 
Squares 

Mean 
Squares 

Components  of 
Variance 

Between  super- 
blocks  (type  of 
weak  acid) 

Between  blocks 
(Factories) 
within  super- 
blocks 

Residual : 
Within  blocks 
(within 
factories) 

n1  —  1  =  2 

nx(n2  —  1) 

=.  6 

n!n2(n3—  1) 

=  54 

(3) -(4) 
=  51,217 

(2) -(3) 
=  3,814 

(l)-(2) 

=  29,884 

25,609 
636 

553 

n2n3  c]  -f  n3  a\  +  a\ 
n8  0]  +  of, 

Total 

n!n2n3  —  1 
=  62 

(l)-(4) 

=  84,915 
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nx  =  number  of  super-blocks 

n2  =  number  of  blocks  within  a  super-block. 

n3  =  number  of  individuals  within  a  block. 

o2  =  variance  due  to  differences  between  super-blocks. 

a2  =  variance  due  to  differences  between  blocks  within  super-blocks. 

a2  =  variance  due  to  differences  between  individuals  within  blocks. 

From  the  last  column,  "Components  of  Variance,"  we  see  that  a2,  can  be  greater 
than  zero  only  if  n3a2  +  o3  is  significantly  greater  than  a3.  Thus  we  compare 
the  Between  Blocks  Mean  Square  with  the  Within  Blocks  Mean  Square.  Similarly, 
in  general  we  test  the  Super-blocks  Mean  Square  against  the  Between  Blocks 
(within  Super-blocks)  Mean  Square,  for  if  it  is  significantly  greater  than  the 

latter  this  can  only  be  so  because  a    is  greater  than  zero. 

Here  we  see  that  the  Between  Blocks  Mean  Square  is  not  significantly  greater 
than  the  Residual,  so  we  can  assume  that  a2  =  0  and  pool  the  two  sets  of  Sums 
of  Squares  and  Degrees  of  Freedom  to  get  a  new  Residual  of  33,698/60  =  561.6 
with  60  degrees  of  freedom. 

Testing  the  Between  Super-blocks  Mean  Square  against  this  residual  we 
have  a  variance  ratio  of  25,609/561.6  =  45.6  for  degrees  of  freedom  nx  =  2, 
n2  =  60.  For  these  degrees  of  freedom  and  the  0.1%  level  of  significance  the 
variance  ratio  needs  to  be  not  less  than  7.8.  It  follows  that  the  Between  Super- 
blocks  Mean  Square  is  very  much  more  significant  than  the  0.1%  level.  Solving 
the  equations  3x7  at2  -f-  <r32  =  25,609,  a32  —  562,  (we  have  shown  o22  =  0)  we 
obtain  the  value  1192  for  c^2.     We  thus  get 

Ci*  =  1192 
a22=        0 

o32  =    562 

The  value  of  this  analysis  is  that  it  shows  that  differences  between  factories 
within  an  acid  group  are  not  detectable,  and  that  the  larger  part  of  the  variability 
in  throughput  achieved  is  due  to  differences  between  acid  groups. 

The  use  of  the  first  point  is  that  it  enables  us  to  pool  data  for  all  factories  within 
an  acid  group  for  any  detailed  comparison:  e.g.,  for  comparisons  between 
foundries,  in  some  instances  there  are  10  pots  of,  foundry  A  and  1  pot  of  foundry 
B  at  one  factory,  1  pot  of  foundry  A  and  10  pots  jfrom  foundry  C  at  a  second 
factory,  and  so  on.  So  long  as  we  are  confined  to  comparisons  within  a  factory, 
our  estimates  of  foundry  differences  will  be  very  inaccurate.  Having  proved  that 
we  can  pool  the  data  for  all  factories  within  an  acid  group,  our  comparisons  will 
be  much  more  useful. 

The  use  of  the  second  point  is  that  it  throws  great  emphasis  upon  the 
importance  of  the  type  of  acid  being  concentrated.  It  can  only  be  followed  up 
using  orthodox  chemical  considerations,  of  course. 

(j)  Incomplete  Four-Factor  Analysis  :  Three  Factors  with  Replication 

(i)  In  our  complete  four-factor  analysis  we  had  s  suppliers  working  from 
each  of  r  sources  of  raw  material  and  providing  g  grades  of  polymer  which  were 
worked  at  each  of  t  temperatures.  If,  instead,  we  work  t  batches  all  at  the  same 
temperature,  we  then  have  a  three  factor  analysis  with  replication,  the  three 
factors  being  grade  of  polymer  (G),  supplier  (S)  and  source  of  raw  material  (R). 

We  derive  the  analysis  of  variance  for  this  situation  from  the  complete  four 
factor  analysis  given  in  Table  11.8.     We  have  the  T  main  effect,  the  first  order 
interactions  involving  T  and  the  second  order  interactions  involving  T,  all  non- 
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existent.     We  can  therefore  pool  their  degrees  of  freedom  and  sums  of  squares 
with  the  residual,    and  we  obtain  Table  11.24  :— 

Table  11.24 


Source  of 

Degrees  of 

Components  of 

Variance 

Freedom 

Variance 

G 

(g-1) 

Oo2  +  ta2srg  +  tsa2rg  + 
rta2gs  +  tsro2g 

S 

(8-1) 

Oo2  +  to28rg  +  gtaV  + 
rto2g8  +  gtraas 

R 

(r-1) 

a2o  +  tC2srg  +  gto2sr  + 
tsa2rg  4-  gtsa2r 

G  x  S 

(g-l)(s-l) 

a20  +  tCT28rg  +  rta2gS 

S  x  R 

(s~l)(r-l) 

a02  +  ta2srg  +  gta2sr 

Rx  G 

(r-1)  (g-1) 

a02  rf  ta2srg  +  tsa2i^ 

G  x  S  x  R 

(g-l)(s-l)(r-l) 

o2o  +  ta2srg 

Residual 

(g  —  1)  (t  —  1)  (r  —  1)  (s  —  1) 

+  (r-l)(g-l)(t-l) 

+  (t-l)(s-l)(r-l) 

4-(g_l)(t_l)(s-l) 

+  (t— l)(r— l)  +  (t— l)(s— 1) 

+  (g~l)(t-l)  +  (t-l) 

=  gsr(t— 1) 

ao" 

Total 

gsrt— 1 

(ii)  Suppose  we  carry  out  a  three  factor  experiment,  and  replicate  each  point. 
This  could  be  regarded  as  a  four  factor  experiment,  the  fourth  factor  being  the 
order  of  replication  and  when  the  complete  analysis  of  variance  is  obtained  pooling 
with  the  residual  all  factors  involving  A,  viz.,  if  N.  D,  and  R  are  the  three  main 
factors,  pooling  A,  A  X  N,  A  x  D,  A  x  R,  A  X  N  X  D,  A  X  D  X  R,  and 
A  x  R  X  N. 

The  arithmetic  is  lighter,  however,  if  we  total  over  all  levels  of  A,  i.e.  sum 
over  all  replications,  for  each  value  of  N,  D,  and  R,  thus  getting  a  three  factor 
table  in  which  each  figure  is  the  sum  of  the  replications. 

Consider  the  data  in  the  table  below  : — 


N, 

N2 

D, 

D2 

Dx 

D8 

Ri 

R2 

Ri 

R2 

Ri 

R2 

R, 

R2 

Ax 

A2 

—3 

—7 

—1 
—6 

—8 

—3 

—6 

—5 

8 
—6 

8 
1 

10 
0 

7 
—3 

S(A) 

—10 

—7 

—11 

—11 

2 

9 

10 

4 

The  row  S(A)  is  the  sum  of  At  and  A2  for  each  value  of  N,  D,  and  R.  The 
data  in  this  row  is  subjected  to  the  same  three  factor  analysis  previously  described, 
except  for  the  following  difference  : 
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(1)  Whenever  we  sum  a  set  of  squares,  we  divide  by  the  number  of  replica- 
tions, here  2,  in  addition  to  the  other  divisors. 

Thus  one  of  our  three  two-way  tables  is 


D,             D2 

Totals 

N2 

N2 

—17            —22 
11                14 

—39 
25 

Totals 

—6             —8 

—14 

The  correcting  factor  is  ( — 14)a/8  X  2=  12.25 
The  sum  of  squares  for  N  is 

[(— 39)2  +  (25)2]/4  x  2  —  12.25  =  256.00 
The  sum  of  squares  for  D  is 

[(— 6)2  +  (— 82]/4  x  2  —  12.25  =  0.25 
The  N  X  D  interaction  is  given  by 

[(— 17)2  +  (— 22)2  +  (ll)2  +  (142)]/2  x  2  —  12.25  —  256.00  —  0.25 
=  4.00 

The  Sums  of  Squares  of  the  R,  R  X  D,  and  R  X  N  effects  are  obtained 
similarly. 

(2)  An  independent  estimate  of  the  Residual  is  obtained  as  the  difference 
between  the  sum  of  the  squares  of  all  the  original  individuals  and  (the  sum  of 
the  squares  of  the  values  for  S(A)  divided  by  the  number  of  replications,  here  2), 
i.e. 

(_3)2  +  (_1)2  _|_  .  _  (_3)2  _  ((_10)2  +  (— 7)2  +  .  .  .  42)/2  =  256 

(3)  The  Total  Sum  of  Squares  is  given  by  the  difference  between  the  sum  of 
the  squares  of  the  original  individuals  and  correcting  factor,  i.e. 

(— 3)a  +  (-l)a  +  .  .  .  +  (— 3)2  —  12.25  =  539.75 

(4)  The  N  X  D  X  R  Interaction  Sum  of  Squares  is  obtained  as  the  difference 
between  the  Total  Sum  of  Squares  and  the  sum  of  all  the  other  terms. 

The  results  of  this  analysis  are  as  below : 


Source  of 

Sums  of 

Mean 

Variance 

Squares 

Degrees  of  Freedom 

Squares 

N 

256.00 

n— 1  =  1 

256.00 

D 

0.25 

d—  1  =  1 

0.25 

R 

1.00 

r— 1  =  1 

1.00 

N  x  D 

4.00 

(n  -  1)  (d  -  1)  =  1 

4.00 

D  x  R 

16.00 

(d  — l)(r—  1)  =  1 

16.00 

R  X  N 

0.25 

(r —  1)  (ix  —  1)  =  1 

0.25 

N  x  D  x  R 

6.25 

(n-l)(d-l)(r-l)  =  l 

6.25 

Residual 

256.00 

n  d  r  (  a—  1)  =  8 

32.00 

Total 

539.75 

rnda — 1  =  15 

The  N  X  D  X  R  interaction  is  tested  against  the  Residual,  and  is   obviously 
not  significant.     In  fact,  all  the  terms  except  the  N  main  effect  are  not  significant 
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and  can  be  pooled  to  give  a  new  residual  of  283.75/14  =  20.2  :  the  N  main  effect 
then  gives  a  variance  ratio  of  256.00/20.2  =  12.7  with  degrees  of  freedom  nx  =  1, 
n2  —  14,  this  being  more  significant  than  the  1%  level  of  significance.  We  can 
then  calculate  the  means  for  Nt  and  N2  as  —  4.875  and  3.125  respectively. 

It  may  have  been  in  this  example  that  the  second  replication  (series  R2)  may 
be  at  a  different  general  level  from  the  first  (series  Rj),  due  to  an  adjustment  of 
the  plant  between  the  two  series,  a  new  batch  of  raw  material,  etc.  The  variance 
due  to  such  an  effect  will  be  included  in  the  Residual,  and  if  large  will  give  us  an 
inflated  estimate  of  the  Residual,  thus  lowering  the  accuracy  of  our  experiment. 

Accordingly  we  may  take  out  from  the  Residual  the  Sum  of  Squares  corres- 
ponding to  the  replication  by  forming  the  total  for  series  Rx  =  15  and  for  series 
R2  =  ( — 29)  and  then  calculating 

[I52  _|_  (— 29)2]/8  —  12.25  =  121.00 

Thus  the  Sum  of  Squares  for  the  Residual  is  now  256.00  —  121.00  =  135.00  and 
its  Degrees  of  Freedom  are  8  —  1  =  7  giving  a  Mean  Square  of  19.29. 

In  this  case,  therefore,  we  have  gained  considerably  by  taking  out  the  series 
Sum  of  Squares,  and  we  have  increased  the  likelihood  of  detecting  significance 
in  the  Terms  very  considerably. 


(k)  Doubly  Incomplete  Four  Factor  Analysis 
Double  Order  Replication 


Two  Factors  with 


(i)  Let  us  now  suppose  that  in  the  previous  example  the  suppliers,  instead 
of  working  to  product  g  different  grades,  actually  produced  g  nominally  identical 
grades.  We  still  process  t  batches  from  each  of  these  also  at  the  same  temperature 
as  before. 

The  variance  can  be  analysed  now  into  between  source  of  raw  material  (R), 
between  suppliers  (S),  and  the  R  X  S  interaction,  and  the  two  further  components 
shown  below  in  Table  1 1.25.  The  degrees  of  freedom  and  sums  of  squares  for  the 
new  G  term  are  obtained  by  pooling  them  together  with  the  G  X  S,  S  X  R  and 
G  X  S  X  R  terms.     The  Residual  is  the  same  as  in  Table  11.24, 

Table  11.25 


Source  of  Variance 

Degrees  of  Freedom 

Components  of  Variance 

R 

S 

R  x  S 

Between  G 
Within  R  and  S 

Within  G  (Residual) 

(r-i) 
(s— I 
(r—  i)(s—  i) 

(g  —  i)  +  (g  —  i)  (r  —  i)  +  (g  —  O  (s  —  i) 
+  (g  —  i)  (r  —  i)  (a  —  i)  =  rs  (g  —  i) 
gsr  (t  —  i) 

°2o  +  tcj2G  +  gt02rs  +  gtso2r 
a2o  +  tcj2G  +  et<j2rs  +  tf^i 
°\  +  tcj2G  +  *ta2» 
°2o  +  «fc 
a2 

0 

Total 

gsrt  —  i 

(ii)  The  purpose  of  this  experiment  was  to  determine  whether  the  position  of  a 
scroll  of  nitrated  paper  in  a  stabilising  vat  had  any  effect  upon  its  nitrogen  content. 
Two  scrolls  (S)  were  stabilised  in  each  of  two  positions  (P)  and  samples  were 
taken  from  the  top  and  bottom  of  each  scroll  (F).  Duplicate  determinations  of 
the  nitrogen  content  were  performed  on  each  of  these  eight  samples. 
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At  first  sight  this  would  appear  to  fall  in  the  class  of  three  factors  (P  X  S  X  F) 
with  replication,  the  class  which  we  discussed  in  the  previous  section. 


Position 
of  Scroll 

Scroll 
Number 

Fraction 
of  Scroll 

%  Nitrogen 

Pi 

s, 

F2 

12.20,     12.20 
12.26,     12.26 

s2 

F2 

12.10,     12.10 
12.39,     12.38 

Pa 

S'x 

Fx 
F2 

12.31,     12.30 
12.44,     12.44 

8', 

Fx 

F2 

12.23,     12.21 
12.37,     12.35 

However,  a  moment's  reflection  will  show  that  there  is  no  true  one-to-one 
correspondence  between  Sx  and  S'^  or  between  S2  and  S'2.  Sx  has  no  more 
association  with  S'j  than  with  S'a.  It  is  thus  evident  that  the  factor  S  is  really 
a  form  of  replication. 

The  second  point  is  that  the  duplication  of  analysis  does  not  constitute  a 
genuine  replication,  i.e.  a  genuine  repeat  of  the  whole  experiment  (to  perform 
the  latter  we  should  have  to  use  different  scrolls  in  a  different  vat). 

To  carry  out  the  analysis  of  variance  we  might  carry  out  the  full  analysis, 
and  then  pool  together  the  degrees  of  freedom  and  sums  of  squares  on  the  lines 
indicated  previously.  However,  it  is  less  trouble  to  evaluate  the  relevant  terms 
directly.  In  the  calculation  below  the  zero  used  is  12.20  and  the  scale  has  been 
multiplied  by  100. 

(1)  We  need  the  square  of  the  grand  total  divided  by  the  number  of  individuals 
in  that  total  (the  usual  correcting  factor) : 

134716-  1122.25 

(2)  We  need  the  two-way  table  for  P  and  F. 


Fx 

F2 

Pi 

Pa 

—20 
25 

49 

80 

29 
105 

5 

129 

134 

From  this  table  the  P,  F,  and  P  X  F  sums  of  squares  are  obtained  in  the 
usual  manner : — 

P  =  (292  4  1052)/8  —  1122.25  =  361.00 
F=(52    +  1292)/8—  1122.25  =  961.00 
P  x  F  =  (202  +  492  +  252  +  802)/4—  1122.25  —  361.00  —  961.00=  12.25 

(3)  The  sum  qf  squares  between  analyses  is 

(0a  +  02  +  62  +  . .  .  +  152)  —  [(0  +  0)2  +  (6  +  6)a  -f  .  .  .]/2  =  5.00 

(4)  The  total  sum  of  squares  is 

(02  -f  02  +  6a  —  . .  .  .  +  152)  —  1122.25  =  1731.75 
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(5)  The  sum  of  squares  for  Error  is  obtained  by  difference. 
The  resulting  table  is  below : 


Sources  of  Variance 

Sums  of 
Squares 

Degrees  of 
Freedom 

Mean 
Square 

Between  position  in  vat  (P) 

Between  fraction  of  scroll  (F) 

F  x  P 

Error 

Between  analyses 

361.00 

961.00 

12.25 

392.50 

5.00 

1 
1 
1 
4 
8 

361.00 

961.00 

12.25 

98.12 

0.62 

Total 

1731.75 

15 

The  F  X  P  interaction,  tested  against  the  Error  is  clearly  non-significant ; 
we  can  pool  it  with  the  Error.  The  P  main  effect,  the  principal  object  of  the 
experiment,  can  then  be  tested  against  this  Error.  The  F  main  effect  is  not  of 
direct  interest,  but  can  also  be  tested  against  the  Error.  It  was  advisable  to  have 
included  F  in  the  experiment,  however,  as  it  was  quite  possible  that  P  would 
interact  with  it. 

(1)  Trebly  Incomplete  Four  Factor  Analysis  :  One  Factor  with  Triple 
Order  Replication 

(i)  In  the  previous  example,  the  Factor  R  represented  source  of  raw  material. 
Suppose,  on  the  contrary,  that  it  represented  operator  in  the  factory.  Then 
obviously,  there  is  no  point  in  comparing  the  first  operator  in  all  the  factories 
with  the  second  operator  in  all  the  factories,  and  we  are  only  interested  in  this 
respect  in  the  variation  between  the  operators  within  a  factory. 

The  sums  of  squares  and  degrees  of  freedom  for  the  new  R  term  are  given 
by  pooling  them  for  R  and  R  x  S  in  Table  11.25,  thus  giving  us  Table  11.26. 

Table  11.26 


Source  of 
Variance 

Degrees  of 
Freedom 

Components  of 
Variance 

S 
Between  R  within  S 

Between  G  within  R 
Within  G  (Residual) 

(s-1) 
(r-l)  +  (r-l)(s-l) 
=  s(r-l) 
rs(g-l) 
gsr(t  — 1) 

a02+  ta2G  +  gta*R  +  gtra's 
oo2  +  ta2G  -f  gt<J2R 

oo2  +  ta2G 

Total 

gsr(t-l) 

The   system   has   the   structure   indicated   below,   where  there   are   similar 
assemblies  for  subsequent  super-blocks. 

Super-block  1 


Block  1 


Block  2 


I 
Sub-block  1 


I 
Sub-block  2 

I 


Sub-block  3 

I  I 

I  I 


I 
Sub-block  4 


I  I  I 

Ix  I2  Is  I4 

The  number  into  which  the  larger  units  are  split  can  of  course  be  anything  from 
2  upwards. 
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(ii)  Consider  the  data  in  the  table  below.  We  have  an  allegedly  homogeneous 
batch  of  material  contained  in  vats  Vx  to  V6.  The  contents  of  each  vat  are  wrung 
in  a  centrifuge  and  bagged.  We  select  two  bags  Bt  and  B2  from  each  vatload 
and  take  two  samples  St  and  S2  from  each  bag.  On  each  sample  we  perform 
an  analysis  for  percentage  of  an  ingredient,  in  duplicate,  Ax  and  A2. 


Vi 

v2 

v3 

v4 

v6 

v« 

BX 

B2 

Bi 

B2 

Bi 

B2 

Bi 

B2 

Bi 

B2 

Bi 

B2 

Si 

s2 

Si 

s2 

Si 

s2 

Si 

s2 

Si 

s2 

Si 

s2 

Si 

s2 

Si 

s2 

Si 

s2 

si 

s2 

s, 

s2 

Si 

s2 

A1        • 

A2        .. 

29 

29 

28 
27 

29 
29 

27 
28 

29 

29 

27 
28 

26 

27 

24 
25 

32 
30 

29 
30 

25 

27 

30 
3i 

29 
29 

30 
31 

28 
28 

32 
28 

30 
29 

27 
27 

25 
28 

26 
26 

29 
31 

31 
32 

29 

30 

29 
31 

In  carrying  out  the  analysis  of  variance  it  lightens  the  arithmetic  to  take  an 
arbitrary  zero,  e.g.  by  subtracting  20  from  every  observation.     We  form  terms 

(1)  Squa're  every  individual  and  sum  the  squares,  i.e. 
92  -f  82  +  92  -f  72  +  .  .  .  +  102  +  ll2  =  3636 

(2)  Obtain  the  total  for  each  sub-block,  square  the  sub-block  totals,  sum 
these  squares,  and  divide  by  the  number  of  individuals  per  sub-block,  i.e. 
t(9  +  9)2  +  (8  +  7)2  +  (9  +  9)2  +  .  .  .  +  (9  +  10)2  +  (9  +  ll)»]/2 
=  3616, 

(3)  Obtain  the  totals  for  each  block,  square  the  block  totals,  sum  these  squares, 
and  divide  by  the  number  of  individuals  per  block,  i.e. 

[(9  +  9  +  8  +  7)2  +  (9  +  7  +  9  +  8)2  +  . . .  +  (9  +  10  +  9  +  H)»]/4 
=  3570.5 

(4)  Obtain  the  totals  for  each  super-block,  square  the  super-block  totals, 
sum  these  squares,  and  divide  by  the  number  of  individuals  in  each 
super-block,  i.e. 

(662  +  55 2  +  742 +  822)/8  =  3534.25 

(5)  Obtain  the  grand  total  for  all  individuals,  square  it,  and  divide  by  the 
grand  number  of  individuals,  i.e. 

4082/48  =  3468 
From  these  terms  we  form  the  analysis  of  variance  in  Table  11.27  below. 

Table  11.27 


Source  of  Variance 

Degrees  of 
Freedom 

Sura  of 
Squares 

Mean 
Squares 

Components  of  Variance 

Between  Super-blocks 

(vats) 
Between  Blocks  within 
Super-Blocks 
(Bags  within  vats) 
Between  Sub-blocks 
within  Blocks  (Samples 
within  Bags) 
Between  Individuals 
within  Sub-blocks 
(Analyses  within 
Samples) 

nx  — 1  -  s 

nx(n2  —  1)  -  6 

n1n2(n3  —  i) 
=  12 

n1n2ns(n4  —  i) 
=  24 

=  66.25 

(3)  —  (4) 
=  36.25 

(2)  —  (3) 
-  45.50 

(1)  —  (2) 
=  20.00 

13-25 
6.04 
3-79 
0.833 

n2n3n4CTl2   +  n3n4<y28  +  n4°23  +  <*4* 
n3n4a22  +  n4C32  +  <J42 
n4(J32  +  o4* 

Total 

n1n2n3n4  —  I 
-  47 

(1)  -  is) 

=  168.00 

number  of  super-blocks  =  6 
number  of  blocks  within  a  super-block  =  2 
number  of  sub-blocks  within  a  block  =  2 
number  of  individuals  within  a  sub-block  =  2 
°i2>  tfssS  <*32>  ff42  are  respectively  the  variances  due  to  the  four  terms  listed 
under  the  heading  "Source  of  Variance." 


rii  = 

n2  = 

n3  = 

n4  = 
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Proceeding  to  the  analysis  of  Table  11.27,  the  Between  Samples  with  Bags 
Mean  Square,  tested  against  the  Between  Analyses  within  Sample  Mean  Square 
(to  test  for  the  existence  of  o32 — see  the  Components  of  Variance),  gives  a 
variance  ratio  of  3.79/0.833  =  4.6  for  degrees  of  freedom  nx  =  12,  n2  =  24: 
this  is  more  significant  than  the  0.1%  level  of  significance. 

To  test  for  the  existence  of  a22,  we  test  the  Between  Bags  within  Vats  Mean 
Square  against  the  Between  Samples  within  Bags  Mean  Square.  The  variance 
ratio  is  6.04/3.79  =  1.6  for  degrees  of  freedom  nt  =  6,  n2  =  12.  This  is  less 
significant  than  the  20%  level,  so  we  can  take  it  that  o22  does  not  exist.  We 
pool  the  two  terms  to  get  a  new  estimate  of  (n4a32  -f  °42)  of  81.75/18  =  4.53 
with  18  degrees  of  freedom. 

To  test  for  the  existence  of  ax2,  we  test  the  Between  Vats  Mean  Square 
against  the  Between  Samples  within  Bags  Mean  Square.  We  have  shown  that 
ct22  does  not  exist,  so  we  are  testing  (n2n3n4CT12  +  n4a32  +  <*42)  against  (n4a32  + 
<r42).  The  variance  ratio  is  13.25/4.53  =  2.93  with  degrees  of  freedom  nx  =  5, 
n2  =  18.     This  is  more  significant  than  the  5%  level  of  significance. 

We  can  now  calculate  ctx2  as  (13.25  —  4.53)/8  =  1.09  and  a32  as'(4.53  —  0.83) 
/2  =  1.85. 

Our  final  conclusion  thus  is  that  the  variability  can  be  accounted  for  by  the 
terms 

oV  =   1.09 
a22  =  0.00 
a32  =   1.85 
a42  =  0.83 
We  see  that  there  is  an  appreciable  variability  between  vats,  the  variability  between 
bags  from  a  given  vat  is  not  large  enough  to  be  detected,  the  variability  between 
samples  within  a  given  bag  is  quite  large,  in  fact  the  most  important  component, 
and  the  variability  between  duplicate  analyses  on  a  given  sample  is  quite  appre- 
ciable.    The  total  variance  of  a  single  analysis  is  thus  (1.09  +  0.00  +  1.85  -f  0.83) 
=  3.77. 

It  is  further  clear  from  this  analysis  of  variance  that  the  allegedly  homogeneous 
batch  is  significantly  inhomogeneous,  and  that  the  larger  part  of  this  inhomo- 
genity  arises  in  bagging  the  material  (a32  =  1.85).  Each  vat  is  apparently 
homogeneous  (a22  =  0.00),  but  there  is  an  appreciable  difference  between  vats 
K2  =  1.09). 

(m)  An  Incomplete  Five  Factor  Analysis 

When  we  come  to  five  factor  analyses  the  number  of  different  types  of  incom- 
plete analysis  is  considerable.  We  will  discuss  only  one  type(4),  but  from  such  a 
viewpoint  that  it  is  hoped  that  the  general  method  of  treatment  will  be  clear. 

The  table  below  gives  the  data  of  an  experiment  on  a  process  using  a  cellulose 
derivative.  The  final  product  was  subject  to  fluctuations  in  a  certain  property, 
and  it  was  required  to  find  out  at  what  stages  the  variability  was  arising.  The 
cellulose  derivative  reaches  the  process  in  large  homogeneous  blends,  and  from 
these  a  number  of  batches  are  mixed.  Normally  the  whole  of  each  mix  is  put 
into  a  single  truck  and  then  dried,  after  which  it  is  possible  to  observe  the  quality 
of  the  product.  Under  this  procedure  it  is  impossible  to  disentangle  the  effects 
of  mixing  from  those  of  drying  :  to  do  this  requires  that  a  truck  should  contain 
parts  of  more  than  one  mix  (in  order  to  compare  mixes  independently  of  their 
drying)  and  that  a  mix  should  be  dried  in  more  than  one  truck  (in  order  to  compare 

(4)  A  complicated  example  dealing  with  glass  manufacture  is  described  by  C.  E.  Gould 
and  W.  M.  Hampton  :  Supplement  to  Journal  of  the  Royal  Statistical  Society,  Vol.  Ill, 
page  137,  1936.  See  particularly  E.  S.  Pearson's  contribution  to  the  discussion,  page  156. 
This  example  is  also  discussed  by  Tippett,  "Methods  of  Statistics,"  Section  10,  31,  and 
by  Kendall,  "The  Advanced  Theory  of  Statistics,"  Vol.  II,  page  202,  where  he  treats  it 
as  a  complete  analysis. 
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dryings  independently  of  the  material  of  the  mix).  For  practical  reasons  it  was 
impossible  to  split  a  mix  into  more  than  two  portions.  Accordingly  a  mix  was 
made  and  split  into  two  portions,  one  portion  going  into  one  truck  and  the  other 
into  another  truck.  The  next  mix  was  also  split  and  the  two  portions  filled  up 
the  previous  two  trucks.  A  second  pair  of  mixes  taken  from  the  same  blend  of 
the  cellulose  derivative  were  similarly  tested.  The  whole  was  repeated  on  three 
subsequent  blends.  On  every  individual  (that  is,  material  from  a  particular  truck 
and  mix  from  a  particular  pair  from  a  particular  blend)  a  duplicate  determination 
of  quality  was  made. 


Blend 
Numbers 

Pair 
Numbers 

Mix 
Numbers 

Mix 
Totals 

Pair 
Totals 

Blend 
Totals 

i 

i 

i 

2 

ii  16 
o   o 

27 

o 

17  16 
14  13 

33 
27 

60 

27 

27 

60 

87 

2 

3 
4 

12   12 

6   6 

24 

12 

10   8 
1   3 

18 
4 

42 
16 

36 

22 

58 

145 

2 

3 

5 
6 

9   9 

3   i 

18 
4 

19  18 
6   6 

37 
12 

a 

22 

49 

7i 

4 

I 

S   § 

17 

14 

t  1 

27 
13 

44 
27 

31 

40 

71 

142 

3 

5 

9 

IO 

IS  17 

21   20 

32 
41 

22   21 

13   13 

43 
26 

75 
67 

73 

69 

142 

6 

II 

12 

18  18 
6   6 

36 
12 

19   21 

6   4 

40 

IO 

76 
22 

48 

50 

98 

240 

4 

7 

13 

14 

II   IO 

5   6 

21 
II 

3    2 

I   0 

5 

I 

26 

12 

32 

6 

38 

8 

IS 
16 

8   8 
4   4 

16 

8 

6  6 

7  8 

12 

15 

28 
23 

24 

27 

5i 

89 

In  the  table  the  first  two  figures  11  and  16  are  the  duplicate  determinations 
of  quality  on  Mix  1  Pair  1  Blend  1,  first  truck.  The  27  is  the  total  of  11  and  16. 
The  next  pair  of  figures  17  and  16  are  the  determinations  on  the  second  truck  of 
mix  1,  and  the  33  following  is  their  total.  The  last  figure  in  the  row  is  the  mix 
total.  The  second  row  is  for  the  second  mix  and  is  precisely  identical.  The 
third  row  has  27  as  the  total  for  the  first  truck,  60  as  the  total  for  the  second  truck, 
and  87  as  the  pair  total.  The  same  follows  for  the  second  pair  for  blend  1,  with 
the  addition  of  145  as  the  blend  total. 

Prima  facie  this  is  a  five  factor  analysis,  the  factors  being  Blends  (B),  Pairs  (P), 
Mixes  (M),  Trucks  (T)  and  Determination  of  Quality  (A).  It  is  immediately 
apparent  that  it  is  not  a  complete  five  factor  analysis,  however,  as  A  is  obviously 
a  simple  replication  :  there  is  no  distinction  between  all  the  first  determinations 
as  compared  with  the  second.  Further,  there  is  nothing  special  to  distinguish 
the  first  pairs  from  the  second  pairs,  so  the  pair  main  effect  P  will  not  exist  and 
we  will  have  a  Between  Pairs  within  Blends  effect  formed  by  pooling  the  sums 
of  squares  and  degrees  of  freedom  of  the  P  and  B  X  P  terms. 

We  will  proceed  to  calculate  this  term  and  the  B  main  effect.     The  correcting 
factor,  the  grand  total  over  the  grand  number  of  individuals,  is 
616a/64  =  5929 
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The  Blend  main  effect  is  of  course  given  by  the  blend  totals 

(1452  +  1422  +  2402  +  892)/16  -    5929  =  740.37 
The  Between  Pairs  within  Blends  term  is  given  directly  by  squaring  the  pair 
totals  and  dividing  by  the  number  of  individuals  therein,  and  subtracting  the 
correcting  factor  and  the  Blend  sum  of  squares,  i.e. 

(872  +  582  +  ...  +  51 2)/8_  5929  —  740.37=  184.13 
We  would  of  course  get  the  same  result  if  we  formed  a  B  X  P  table,  calculated 
the  B,  P,  and  B  X  P  (residual)  and  then  pooled  P  with  B  X  P. 

We  now  come  to  consider  the  M  effect.  There  can  be  no  M  main  effect, 
as  the  first  mix  in  all  pairs  are  not  systematically  distinguished  from  the  second 
mixes  :  we  therefore  have  that  am2  ==  0.  Further  the  M  effect  is  only  of  interest 
within  a  given  pair,  which  is  within  a  given  blend  :  this  term  is  ombp2,  and  amt>2 
and  amp2  are  both  zero.  If  we  strike  out  am2,  amb2  and  amp2  from  the  components 
of  variance  of  the  M,  M  X  B,  and  M  X  P  mean  squares,  we  find  we  have  in  each 
case  left  as  the  components  of  variance 

oa2  +  aa2mtbp  +  ata2mbp 
which  is  the  same  as  for  the  M  X  B  X  P  mean  square.     We  therefore  pool  these 
four  terms,  M,  M  x  B,  M  x  P,  and  M  X  P  X  B,  and  call  the  resulting  term 
"Between  Mixes  within  Pairs  and  Blends."     The  pooled  degrees  of  freedom  are 

(m_l)  +  (m_l)(b_l)  +  (m-l)(p-l)  +  (m-l)(p-l)(n-l) 

-bp(m— 1). 
It  is  clear  that  this  is  a  reasonable  answer  :  looking  at  it  in  a  rather  different  way, 
each  pair  of  mixes  contribute  (m  —  1)  degrees  of  freedom,  and  there  are  bp  pairs, 
so  the  total  is  bp(m  —  1). 

To  calculate  this  pooled  MxBxP  term  we  need  a  M  x  B  X  P  table,  i.e. 
a  table  formed  by  totalling  over  A  and  T.  Such  a  table  is  already  formed  in  the 
column  headed  "Mix  Totals,"  60,  27,  4,  16,  55,  etc.  The  total  sum  of  squares 
for  this  table  is 

(602  +  272  +  42  +  .  .  .  +  232)/4  —  5929  =  1771.5 
To  obtain  our  pooled  M  X  B  X  P  term  we  could  calculate  in  full  M,  M  X  B, 
BxP,  MxBxP  and  then  pool  them.  We  may  note,  however,  that  the 
Total  of  an  M  X  B  X  P  table  is  made  up  of  these  terms  plus  B,  P,  and  B  X  P. 
These  latter  we  already  have,  so  the  pooled  M  X  B  X  P  term  is  very  easily  obtained 
as 

1771.5  —  740.37  —  184.13  =  847.00 
Its  degrees  of  freedom  are 

bp(m  —  1)  =  4  X  2(2  — 1)  =  8 

The  T  term  is  obtained  in  exactly  the  same  wav  as  the  M  term.  We  require 
the  term  formed  by  pooling  T,  T  X  B,  T  X  P,  and  T  x  B  X  P,  so  we  work  from 
the  T  X  B  X  P  table  formed  by  totalling  over  A  and  M.  The  total  sum  of 
squares  for  this  table  is 

(272  +  602  +  362  +  222  +  222  +  . . .  +  272) /4  —  5929.0  =  1274.5 
From  this  we  subtract  the  sums  of  squares  of  B,  P,  and  B  X  P  as  before,  and 
we-obtain  as  the  sum  of  squares  for  the  B  X  P  X  T  term 

1274.5  —  740.37  —  184.13  =  350.00 
The  degrees  of  freedom  are 

bp(t— 1)  =  4  X  2(2— 1)  =  8. 

We  now  want  the  M  X  T  interaction.  As  for  M  and  T,  this  only  exists 
within  Pairs  and  Blends,  so  we  pool  M  X  T  with  MxTxB,  MxTxP 
and  M  X  T  x  B  X  P.  We  therefore  work  from  the  M  x  T  x  B  x  P  table, 
i  e  the  table  formed  by  totalling  over  A.     The  total  sum  of  squares  for  this  table  is 

(.27 a  +  60a  +  02  +  27a  +  24a  +  .  . .  4-  152)/2  —  5929.0  =  2316.00 
To  obtain  the  sum  of  squares  for  our  pooled  MxTxBxE  term  we  can 
subtract  from  this  total  all  the  other  terms  arising  inanMxTxBxP  table. 
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These  are 

(a)  B  and  B  X  P  :  these  we  have  as  740.37  +  184.13. 

(b)  M,  M  X  B,  M  X  P  and  M  X  B  X  P  :  this  we  have  calculated  as  847.00 

(c)  T,  T  X  B,  T  X  P,  and  T  X  B  X  P  :  this  we  have  calculated  as  350.00. 

The  sum  of  squares  is  thus 

2316.00  —  740.37  —  184.13  —  847.00  —  350.00  =  194.50 
Its  degrees  of  freedom  are 

(m_l)(t_l)  +  (m_l)(t_l)(b-l)  +  (m-l)(t-l)(p-l) 

+  (m  —  1)  (t  —  1)  (p  —  1)  (b  —  1) 

=  bp(m— l)(t— 1) 

=  4x  2(2— 1)(2— 1) 

=  8. 
The  final  term  is  the  Between  Analyses  which  is 

(ll2  +  162  +  172  +  162  +  02  +  .  .  .  82)  —  (272  +  332  +  02  +  . .  .  +  152)/2 
=  8278  —  8245  =  33.00 
The  total  sum  of  squares  is  the  usual 

(ll2  +  162  +  . . .  +  82)  —  5929  =  2349.00 
We  can  now  enter  all  these  terms  into  a  table  of  analysis  of  variances. 


Mean  Squares  of 

Source  of 

Complete  Analysis 
being  Pooled 

Sums  of 

Degrees 

Mean 

Variance 

Squares 

of  Freedom 

Squares 

Components  of  Variance 

Between 
Blends 

B 

740-37 

3 

246.70 

°a\+  aamtbp  +  """Abp  + 
at<Tmbp  +  "^bp  +  amtp<Jb 
aa22+a(Tmtbp  +  ^tbp   + 

Between  Pairs 
within  Blends 

P 

P  x  B 

184.13 

4 

46.03 

atCT  mbp  +  ^^bp 

2               2                            2 

BetweenTrucks 
within  Pairs 

T 

T  x  B 

350-00 

8 

43-75 

°a    +  ^mtbp  +  "^tbp 

and  Blends 

T  x  P 

T  x  B  x  P 

Between  Mixes 
within  Pairs 

M 

M  X  B 

847.00 

8 

105.87 

aa    +  aCTmtbp  +  atambp 

and  Blends 

M  x  P 

M  x  B  x  P 

2           2 

Mix  x  Truck 

M  x  T 

194.50 

8 

24-31 

<*a    +  aCTmtbP 

Interaction 

M       T  x  B 

within  P<u'rs 

m  ,   r  x  p 

and  Blends 

M  x  T  x  B  x  P 

Between 

33  -oo 

32 

1.03 

O  2 

Analyses 

a 

Total 

2349.00 

63 

The  appropriate  tests  of  significance  can  be  worked  out  from  the  components 
of  variance.  It  may  be  noted,  however,  that  if  both  the  Between  Mixes  and 
Between  Trucks  terms  are  significant  when  tested  against  the  Mix  x  Truck 
interaction,  i.e.  if  both  a2mbp  and  afbp  are  greater  than  zero,  then  there  is  no 
valid  test  for  the  Between  Pairs  Mean  Square  This  circumstance  is  similar  to 
that  arising  in  Table  21  (Chapter  XI  (d)). 

The  Between  Trucks  Mean  Square  is  not  significant,  but  it  is  probably  worth 
while  calculating  its  component  of  variance.     We  obtain 
c2   =     1.03 
*L  =  H.64 


ap! 


=  15.40 
=     4.86 


0.00 
ab2   =   10.05 
It  is  clear  that  the  origins  of  variability  are  rather  widespread,  Mixing  and 
its  interaction  with  Trucks  (drying)  and  Blends  being  responsible  for  the  greater 
part  of  the  variability. 
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CHAPTER  XII 
MISCELLANEOUS   ASPECTS   OF   THE  ANALYSIS   OF  VARIANCE 

(a)  Introduction 

In  this  Chapter  a  number  of  miscellaneous  aspects  of  the  use  of  the  analysis 
of  variance  will  be  discussed.  The  next  five  sections  contain  matter  which  might 
be  borne  in  mind  in  planning  and  analysing  the  results  of  factorial  experiments, 
Section  (g)  is  an  account  of  a  useful  experimental  device  (the  Latin  Square),  and 
the  subsequent  sections  describe  various  applications  of  the  analysis  of  variance. 

(b)  The  Use  of  Components  of  Variance (x) 

In  the  systematic  discussion  of  the  Analysis  of  Variance  in  Chapter  XI  for 
each  type  of  analysis  the  components  of  variance  corresponding  to  each  mean 
square  were  given. 

It  should  be  realized  that  the  mean  square  only  estimates  these  terms,  and 
one  must  not  regard  them  as  precise  algebraic  identities.     Thus  in  the  complete 
three  factor  analysis  (Table  22)  we  had 
31.08  estimates  wa2ta  +  <Jo2 
161.08  estimates  o02 

It  is  evident  that  ata2  does  not  exist,  i.e.  is  zero,  and  so  we  have  two  estimates  of 
oo2  which  we  can  pool  in  the  usual  way,  taking  into  account  the  degrees  of  freedom 
(i.e.  the  simplest  way  is  to  pool  the  sums  of  squares  and  the  degrees  of  freedom 
and  calculate  the  new  mean  squares,  estimating  a02  as  the  former  divided  by  the 
latter). 

This  pooling  is  of  course  only  valid  when  the  estimates  being  pooled  are 
consistent  as  tested  with  the  variance  ratio  test.  If  (wo2ta  +  o02)  was  signi- 
ficantly greater  than  a02  then  of  course  we  would  not  pool.  It  is  possible  that 
(wata2  +  a02)  may  be  significantly  smaller  than  a02.  This  is  very  unlikely  to  happen 
and  if  it  does  then  the  meaning  is  rather  obscure.  Either  it  is  an  accident  (the 
1  in  20  chance  which  will  occur  1  in  20  times)  and  we  can  ignore  it,  or  some  human 
agency  is  entering  in  a  certain  way  to  bias  the  results.  (2) 

The  Components  of  Variance  are  useful  for  indicating  the  sequence  of  tests 
in  the  testing  of  the  various  terms  of  significance.  They  are  also  useful  for  esti- 
mating the  actual  amount  of  variability  each  part  of  the  process  is  giving  rise  to, 
in  order  to  decide  on  which  part  of  the  process  the  control  must  be  improved  to 
increase  the  uniformity  of  the  product.  This  application  was  discussed  in  Chapter 
VII  (Section  (c))  and  Chapter  XI  (Section  (m)). 

The  coefficients  of  the  components  of  variance  given  in  the  various  tables 
in  Chapter  XI  are  strictly  only  valid  when  the  supposed  populations  are  infinite. 
If  the  population  is  made  up  of  batches  of  raw  material,  we  can  comfortably 
suppose  that  there  is  an  infinite  population  of  batches  even  if  we  are  only  con- 
cerned with  a  few  dozen  or  a  few  hundred.  If  the  populations  are  factories,  then 
there  may  be  only  two  or  three  factories  and  unlikely  to  be  any  others  built  in 
the  future.  Under  these  circumstances  it  becomes  a  little  difficult  to  regard  the 
population  as  infinite,  and  strictly  the  coefficients  of  the  components  of  variance 
become  modified.  (3) 

For  practical  purposes  the  modifications  can  be  generally  ignored,  because 

(a)  the  sequence  of  tests  of  significance  is  unaltered,  and  therefore  no  mistakes 
will  be  made  by  using  the  unmodified  coefficients, 

(b)  the  actual  estimates  of  the  component  of  variance  using  the  simple 
coefficients  should  be  multiplied  by  the  factor  (k —  l)/(k  where  k  is  the  size  of 
the  population).     It  is  only  for  k  =  2  or  3  that  the  factor  is  appreciable,  being 

0)  On  this  point  see  S.  L.  Crump  :    Biometrics  Bulletin,  Vol.  2,  No.  1,  page  7,  1946. 

(2)  An  interesting  example  of  this  is  given  by  J.  O.  Irwin,  Journal  of  the  Royal  Statistical 
Society,  Vol.  CIX,  p.  157,  1946. 

(3)  cf.  H.  E.  Daniels,  Supp.  Journ.  Roy.  Stat.  Soc.  VI  No.  2,  1939,  p.  188). 
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0.5  and  0.66  respectively.  For  larger  values  of  k  the  difference  of  the  factor 
from  unity  is  rarely  important  compared  with  the  fairly  large  standard  error  that 
is  of  course  attached  to  these  components  of  variance. 

In  the  case  where  a  factor  is  made  up  of  arbitrarily  chosen  different  levels 
of  a  continuous  variable,  the  population  can  of  course  be  reasonably  regarded  as 
infinite,  but  we  are  probably  more  interested  in  the  actual  mean  values  and  it 
will  probably  be  of  no  interest  to  calculate  the  component  of  variance. 

Even  in  the  simplest  cases  the  expression  for  the  standard  errors  of  the 
components  of  variance  are  complicated,  and  care  must  be  taken  in  comparing 
two  such  components.  In  many  of  the  analyses  of  the  previous  chapter  particular 
terms  had  only  single  degrees  of  freedom.  Under  these  circumstances  the  errors 
of  the  components  would  be  very  large.  The  fact  that  the  expressions  for  the 
components  of  variances  were  given  should  not  therefore  be  taken  as  an  indication 
that  it  would  be  good  policy  to.  calculate  them  in  these  instances  :  they  were  given 
to  illustrate  the  general  case  where  there  may  be  quite  a  number  of  degrees  of 
freedom  and  also  to  show  the  logic  of  the  sequence  of  the  tests  of  significance. 


(c)  Partitioning  a  Sum  of  Squares  into  Linear,  Quadratic,  etc. 
Components 

Frequently  we  have  a  factor  at  3,  4,  5  or  more  levels,  where  the  factor  has  a 
numerical  nature,  the  levels  being  chosen  at  intervals  on  the  range  of  the  continuous 
variable,  e.g.  temperature.  We  may  wish  to  know  whether  the  relation  between 
the  independent  variable  and  the  dependent  variable  can  be  regarded  as  linear, 
or  whether  there  is  appreciable  curvature.  If  the  steps  between  successive  levels 
have  been  made  equal,  e.g.  if  the  temperatures  are  20,  25,  30,  35°C,  then  the 
question  can  be  answered  with  the  greatest  of  ease. 

Consider  the  data  in  Table  12.1.  The  four  values  of  T  represent  four 
temperatures,  increasing  in  equal  steps,  of  nitration  of  cellulose  in  mixed  acid  of 
four  different  water  contents,  represented  by  W,  this  also  varying  in  equal  steps. 
The  dependent  variable  whose  values  are  given  in  the  table  is  the  viscosity  of  the 
resulting  nitro-cellulose. 

Table  12.1 


Tx 

T2 

T3 

T4 

Totals 

w4 

w2 
w3 
w4 

6 

24 

6 

1 

32 
27 
24 
22 

45 
45 
44 

23 

63 
62 
45 
39 

146 
158 
119 

85 

Totals 

37 

105 

157 

209 

508 

A  straightforward  two  factor  analysis  of  variance  gives  the  results  in  Table  12.2. 

Table  12.2 


Source  of 
Variance 

Degrees  of 
Freedom 

Sums  of 
Squares 

Mean 
Squares 

Temperature 
Water  Content 
Residual 

3 
3 
9 

4052.0 

787.5 
367.5 

1350.67 

262.50 

40.83 

Total 

15 

5207.0 
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Both  the  effects  are  obviously  highly  significant.     We  can  proceed  to  partition 
each  of  the  two-  sums  of  squares  into  three  components,  one  degree  of  freedom 
for  the  linear  term,  one  for  the  quadratic  term,  and  one  for  the  cubic  term. 
For  the  linear  term,  the  sum  of  squares  is 

(3  T4  +  T3  —  T2  —  3  T^/NS  ka 
The  T's  represent  the  totals  of  N  observations,  and  Sk2  in  the  sum  of  the 
squares  of  the  co-efficients  of  the  numerator.     Here 
Sk2  =  32  +  l2  +  (— l)2  +  (— 3)2  =  20 
Accordingly  the  linear  component  of  the  sum  of  squares  is 

(3  x  37  +  105  —  157  —  3  x  209)2/4  x  20  =  4032.80 
The  quadratic  component  of  the  sum  of  squares  is 


=  (37- 


T3 

105 


T2  +  T1)2/NSk! 


157  -f  209)2/4  x  4  =  16.00 
The  cubic  component  is 

(T4  —  3  T3  +  3  T2  —  TOVNSk2 
=  (37  —  3  x  105  +  3  x  157  —  209)2/4  x  20  =  3.20 
A  ready  check  on  the  correctness  of  the  arithmetic  is  obtained  by  noticing 
that  the  sum  of  the  three  components  equals  the  total  given  in  Table  12.2. 

The  Sum  of  Squares  for  Water  Content  can  be  partitioned  exactly  similarly 
and  we  get  the  results  in  Table  12.3. 

Table  12.3 


Source  of  Variance 

Degrees  of 
Freedom 

Sums  of 
Squares 

Mean 
Squares 

Temperature  :  Linear 

Quadratic 
Cubic 
Water  Content :  Linear 

Quadratic 
Cubic 
Residual 

9 

4032.80 

16.00 

3.20 

616.06 

132.25 

39.20 

367.50 

4032.80 

16.00 

3.20 

616.06 

132.25 

39.20 

40.83 

Total 

15 

5207.00 

The  linear  components  in  each  case  are  obviously  significant,  and  the  signi- 
ficance of  the  quadratic  and  cubic  terms  can  be  tested  in  the  usual  manner  against 
the  Residual.  It  is  interesting  to  note  that  the  apparent  bend  over  from  W,  to 
Wj  for  decreasing  W  is  not  significant. 

In  the  case  where  there  are  only  three  levels,  there  are  2  degrees  of  freedom, 
one  for  the  linear  component  and  one  for  the  quadratic  component.  These  sums 
of  squares  are  respectively 

(Tx  —  Tt)a/NSk»,       2k2  ==  2 

(2  Ta  —  Tx  —  T3)2/NSk2,  Sk2  =  6 

For  more  than  four  levels,  Fisher  and  Yates.  "Statistical  Tables  for  Biological, 
Agricultural  and  Medical  Research"  (Oliver  and  Boyd)  give  the  appropriate  factors 
up  to  52  levels. 

In  this  case  where  two  factors  are  numerical  variables  changing  in  equal 
steps,  their  interaction  as  well  as  the  main  effects  can  be  split  up.  Consider  the 
3x3  table  below  (Table  12.4). 
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Table  12.4 


xL 

x2 

x3 

Totals 

Yx 

Y2 
Y3 

17 
14 
10 

20 
19 

18 

24 
25 
30 

61 

53 
.68 

Totals 

41 

57 

79 

177 

We  can  use  the  coefficients 

(Tx  —  T3)2/NEk2 
and  (—  T»  +  2T2  —  T3)2/NEk2 
to  partition  the  two  main  effects  into  their  linear  and  quadratic  components,  which 
we  can  represent  as  Xl,  Xq,  Yl  and  Yq  respectively.  We  can  also  partition 
the  4  degrees  of  freedom  for  the  interaction  into  that  attributable  to  Xt  Yl, 
Xq  Yl,  Xl  Yq,  and  Xq  Yq.  To  do  this  we  set  up  a  table  of  the  type  shown 
in  Table  12.5. 

Table  12.5 


XL 
Xq 

1 

—1 

0 

2 

—1 
—1 

Yl      Yq 

1     —1 

0        2 

—1     —1 

To  form  a  table  of  coefficients  for  obtaining  Xl  Yl,.  we  take  the  coefficients 
for  Xl  and  multiply  them  by  the  first  coefficients  for  Yl  to  give  the  first  row, 
by  the  second  coefficient  for  Yl  to  give  the  second  row,  and  similarly  for  the 
third  row.  For  Yl  we  multiply  the  coefficients  for  Xq  by  those  for  Yl  in  a 
similar  manner. 

Proceeding  thus,  we  obtain  the  table  of  coefficients  in  Table  12.6. 

Table  12.6 


XlYl 

XqYl 

XlYq 

XqYq 

1      0  —1 

0      0      0 

—1      0      1 

—1      2  —1 

0  0      0 

1  —2      1 

—1      0      1 

2      0—2 

—1       0       1 

1  —2      1 

—2      4  —2 

1  —2       1 

Sk*  =  4 


2k2  =  12 


Sk2  =  12 


Ek2  =  36 


The  Sk2  quoted  below  each  table  is  the  sum  of  the  squares  of  the  coefficients. 
To  obtain  the  sum  of  squares  due  to  e.g.  Xq  Yl,  we  operate  on  the  figures  in  Table 
12.4  by  the  coefficients  for  Xq  Yl  in  Table  12.6,  square  this  result,  and  divide 
by  Ek2  times  the  number  of  observations  in  each  figure  that  has  been  multiplied 
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by  the  coefficients.  In  the  present  instance  the  latter  figure  is  of  course  1,  but 
if  the  figures  in  Table  12.4  were  the  totals  of  N  observations  then  it  would  be  N. 
Accordingly  we  have  as  the  Sum  of  Squares  for  Xq  Yl 

(—1  X  17  +  2  x  20  —  1  x  24  +  1  x  10  —  2  x  18  +  1  x  30)2/12 
=  0.750 

We  can  now  assemble  the  complete  analysis  of  variance  as  in  Table  12.7. 
The  column  "Sums  of  Squares"  is  obtained  by  squaring  the  figures  in  column  two 
and  dividing  by  column  three.  A  good  check  on  the  arithmetic  is  given  by  the 
agreement  of  the  sum  of  all  the  components  with  the  sum  calculated  directly  from 
the  original  observations. 

Table  12.7 


Products  of 

Source  of 

Coefficients 

NSk2 

Sums  of 

Variance 

and  Data 

Squares 

XL 

—38 

3x2 

240.667 

Xq 

—  6 

3    x   6 

2.000 

Yl 

—  3 

3   x   2 

1.500 

Yq 

—  3 

3   x   6 

0.500 

Xl     Yl 

13 

1    X   4 

42.250 

Xq     Yl 

3 

1    x    12 

0.750 

Xl     Yq 

5 

1   x   12 

2.083 

Xq     Yq 

3 

1    x   36 

0.250 

Total 

290.000 

If  we  had  not  made  this  detailed  analysis,  but  merely  an  analysis  by  rows 
and  columns,  we  would  have  obtained  Table  12.8. 

Table  12.8 


Source  of 
Variance 

Degrees  of 

Freedom 

Sums  of 
Squares 

Mean 
Squares 

X 
Y 

Residual 

2 

2 
4 

242.667 

2.000 

45.333 

121.333 
1.000 

11.333 

Total 

8 

290.000 

This  analysis,  though  better  than  nothing,  is  not  nearly  so  informative  as 
that  of  Table  12.7.  Table  12.8  is  completely  unable  to  give  any  information  on 
the  possible  X  Y  interaction.  With  Table  12.7  we  of  course  have  to  pool  the 
smaller  terms  Xq,  Yq,  Xl  Yq,  Xq  Yl,  and  against  this  Xl  Yl  is  found  to  be 
significant. 

This  device  is  applicable  generally  for  any  number  of  levels.  From  a  table 
of  the  type  of  Table  12.5  we  can  form  tables  of  the  type  of  Table  12.6,  and  this 
procedure  will  work  for  mixed  levels,  e.g.  a3  X  4  or  a  3  X  5ora4  X  5.  Further, 
there  can  be  other  factors  present,  and  this  is  taken  account  of  by  adjusting  N  in 
the  divisor  NSk2  to  correspond  to  the  number  of  original  observations  that  have 
been  summed  to  give  us  the  table,  as  Table  12.4,  on  which  we  operate  with  the 
coefficients. 
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In  the  case  of  four  or  more  levels  for  each  factor,  the  number  of  components 
rises  rapidly,  e.g.  for  a  4  X  4  experiment  the  residual  has  9  degrees  of  freedom 
and  hence  9  components,  for  a  5  X  5  experiment  the  corresponding  number  is  16. 
The  calculation  of  all  of  them  is  time-consuming  and  also  really  unnecessary. 
Normally  the  only  components  that  are  likely  to  be  of  interest  are  Xl  Yl,  Xq  Xl 
and  Xl  Yq.  Even  the  latter  two  are  unlikely  to  be  of  interest  if  Xq  and  Yq 
respectively  have  proved  to  be  small.  Accordingly  it  normally  suffices  to  calculate 
the  total  interaction  in  the  usual  manner  and  obtain  a  residual  as  the  difference 
between  it  and  the  sum  of  the  three  components  mentioned.  In  this  case,  of 
course,  there  is  no  explicit  check  on  the  arithmetic,  so  the  steps  taken  need  careful 
checking. 

It  may  be  of  interest  to  note  that  if  this  procedure  is  applied  to  the  data  of 
Table  12.1  we  obtain  as  the  sum  of  squares  for  TlWl  102.01.  Accordingly  the 
Residual  has  as  its  sum  of  squares  367.50  —  102.01  =  265.49  with  8  degrees  of 
freedom,  the  mean  square  being  33.186.  The  resulting  variance  ratio  for  Tl  Wl 
is  then  3.07  with  degrees  of  freedom  n±  ~  1,  n2  =  8,  and  this  does  not  reach 
the  5%  level  of  significance. 

It  might  be  emphasised  again  that  this  whole  procedure  rests  on  the  assump- 
tion that  each  factor  is  increasing  in  equal  steps,  for  example,  10°C,  15°C,  20°C, 
25°C.  However,  if  we  prefer,  we  can  make  the  steps  equal  on  a  logarithmic 
scale,  for  example  1%,  2%,  4%,  8%.  In  this  case,  Xl  will  measure  the  fit  to 
linearity  on  this  scale,  and  similarly  Xq  and  the  higher  terms  deviations  from 
linearity  on  this  scale. 

(d)  The  Assumption  Underlying  Factorial  Design 

A  factorial  experiment  is  much  more  useful,  in  that  its  interpretation  is 
simplest  and  that  its  efficiency  in  the  sense  of  giving  more  information  for  a  given 
number  of  experiments  than  the  corresponding  classical  design  is  greatest,  when 
the  interactions  are  not  significant. 

It  may  be  interesting  to  note  that  the  non-existence  of  interactions  presupposes 
that  the  dependent  variable  y  can  be  expressed  as  the  sum  of  a  series  of  functions 
of  the  independent  variables  xx,  xa,  etc.,  each  of  the  latter  involving  one  and  only 
one  independent  variable,  i.e. 

y  =  f(x,)  +  f(x2)  +  . . .  .  (1) 

The  functions  can  be  anything,  of  course,  with  no  limit  to  the  degree  of  complexity. 

To  take  a  simple  example,  let 

y  =  xj  +  2xa  (2) 

If  the  two  independent  variables  were  at  two  levels  both  equal  to  1  and  2/ we 
would  obtain  the  results  in  the  table  below  for  y  : 


x,=  l 

x2  =  2 

X1  =  l 

Xl  =  2 

2 
3 

5 
6 

It  will  be  readily  seen  that  there  is  no  xx  X  xa  interaction.  Thus  the  effect 
in  y  of  changing  x2  for  xx  =  1  is  5  —  2  =  3  which  is  identical  with  that  for 
xx  =  2,  6  —  3  =  3,  i.e.  the  effect  on  y  of  changing  x2  is  independent  of  the  level 

Of  Xj. 

Now  consider  a  relationship  of  the  type 

y  =  f(xx)  f(xa)  (3) 

where  the  functions  f(xx)  and  f(x2)  can  be  of  any  form. 
To  take  an  example  of  the  simplest  form,  let 

y  =  xx  x\  (4) 
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With  the  two  independent  variables  at  two  levels  1  and  2,  we  could  obtain  the 
results  as  in  the  table  below  for  y. 


xa  =  1 

x,  =  2 

X1  =  l 

xx  =  2 

1 
2 

4 
8 

It  will  be  readily  seen  that  there  is  an  xx  X  x2  interaction.  Thus  the  effect 
in  y  of  changing  x2  from  1  to  2  for  xx  =  1  is  equal  to  4  —  1  =  3  but  for  xx  =  2 
is  equal  to  8  —  2=6,  i.e.  the  effect  on  y  of  changing  x,  is  dependent  on  the  level 
of  xv 

It  will  be  apparent  that  if  we  take  logarithms  throughout,  equation  (3)  becomes 
logy^logfM  +  logffx,)  (5) 

and  we  now  have  an  equation  of  the  same  form  as  (1),  and  the  analysis  will  be 
non -interacting. 


(e)  The  Use  of  Interactions  as  Estimates  of  Error 

In  our  original  two  factor  (row  and  column)  analysis  what  was  termed  the 
Residual  included  not  only  variance  due  to  error  but  also  to  any  possible  interaction 
between  the  factors.  If  the  latter  exists  and  is  large,  then  we  have  a  large  Mean 
Square  for  the  Residual,  and  hence  neither  of  the  main  effects  is  likely  to  be 
significant,  i.e.  we  conclude  the  experiment  knowing  little  more  about  the  behaviour 
of  the  system  than  when  we  began. 

There  are  two  possible  courses  of  action  open  under  these  circumstances. 

We  can  replicate,  and  carry  out  an  analysis  as  described  in  the  section  on 
Two  Factor  Analysis  with  Replication,  which  would  give  us  an  estimate  of  error 
independent  of  the  Interaction. 

Alternatively,  we  could  repeat  the  series  of  observations,  but  with  a  third 
factor,  held  constant  at  a  certain  level  in  the  first  series,  held  constant  at  another 
level  in  the  second  series.  The  experiment  would  now  be  a  three  factor  one, 
and  we  would  be  relying  upon  the  second  order  interaction  (P  X  Q  X  R)  to  be 
small  or  non-existent  so  as  not  to  inflate  the  Residual. 

It  may  happen,  however,  that  this  second  order  interaction  is  large.  A  good 
example  of  this  was  in  the  Section  on  a  Four  Factor  Experiment,  when  the  four 
factor  experiment  was  broken  down  into  two  three  factor  experiments,  and  the 
S2  had  a  large  Residual  almost  certainly  because  it  contained  a  large  second  order 
interaction. 

Under  these  circumstances  there  are  two  possible  alternatives.  We  can  either 
replicate,  in  which  case  we  would  analyse  the  data  as  in  the  Section  on  Three 
Factor  Analysis  with  Replication. 

Alternatively,  we  could  add  a  fourth  factor,  and  use  the  third  order  interaction 
as  an  estimate  of  error.  It  may  happen  that  this  third  order  interaction  is  large, 
and  so  we  will  have  an  inflated  estimate  of  error,  and  still  not  be  much  better  off. 
However,  if  this  interaction  is  large,  it  is  valuable  to  know  this,  so  it  will  have 
been  as  well  to  include  this  additional  factor. 

A  further  example  is  in  the  Five  Factor  experiment  discussed  in  Chapter 
XI  (f),  where  the  W  X  T  X  D  interaction  was  very  large.  If  the  experiment 
had  been  carried  out  as  a  three  factor  one  on  W,  T,  and  D,  this  large  W  X  T  x  D 
interaction  would  have  been  used  as  the  residual,  and  no  conclusions  could  have 
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been  drawn.  Actually,  of  course,  the  experiment  included  two  further  factors, 
S  and  G,  and  fortunately  neither  of  these  interacted,  so  a  reasonably  small  residual 
was  obtained. 

We  can  thus  formulate  some  general  ideas  about  the  planning  of  factorial 
experiments.  We  have  first  to  decide  what  are  the  main  factors  in  which  we  are 
interested,  and  the  number  of  levels  at  which  we  will  test  them.  We  then  need 
to  decide  which  are  the  factors  which  normally  vary  in  the  process  and  are  likely 
to  interact  with  our  main  factors,  and  include  these,  possibly  only  at  two  levels. 
iWe  now  need  to  decide  whether  the  combination  of  factors  so  far  introduced  will 
give  us  a  satisfactory  residual  as  given  by  some  non-existent  high  order  interaction. 
To  make  this  decision  is  of  course  frequently  difficult.  With  a  certain  amount 
of  prior  information  as  to  the  general  behaviour  of  the  system  we  can  frequently 
have  some  confidence  in  our  conclusion,  but  often  it  is  quite  impossible  to  guess. 
We  can  only  go  on  the  general  rule  that  it  is  not  very  frequent  for  high  order 
interactions  to  exist,  or  if  they  do  exist,  to  be  very  large,  and  be  prepared  to  replicate 
the  whole  experiment  if  we  end  up  with  a  large  residual.  Alternatively,  if  it  is 
absolutely  essential  to  be  certain  of  getting  results  on  the  first  experiment,  then 
we  must  include  a  replication  in  the  first  experiment,  perhaps  at  the  cost  of  omitting 
a  factor. 

It  is  probably  generally  desirable  to  include  batch  of  raw  material  as  one  of 
the  factors,  for  it  may  be  that  all  batches  do  not  react  identically  to  particular 
treatments.  In  an  example  in  the  author's  experience,  a  crude  chemical  was 
subjected  to  a  purifying  process,  and  a  factorial  experiment  established  with  a 
greater  significance  than  1%  that  level  1  of  a  particular  factor  gave  a  much  better 
product  than  level  2.  This  experiment  was  only  carried  out  on  a  single  batch  of 
crude  material,  however,  and  subsequent  experience  showed  that  in  general  this 
conclusion  was  erroneous.  If  the  experiment  had  been  replicated  on  one  or  more 
batches  this  mistake  would  have  been  avoided.  If  we  can  afford  to  carry  out 
replications,  we  should  therefore  replicate  on  different  batches  rather  than  within 
a  single  batch,  this  really  being  to  introduce  an  additional  factor,  batch  of  raw 
material. 

(f )  The  Amount  of  Detail  Required  in  Reports 

In  presenting  in  a  paper  or  report  the  results  of  an  investigation  using  a 
factorial  design  with  an  analysis  of  variance  upon  the  results,  it  is  desirable  to  give 

(a)  the  factors  and  the  levels  at  which  they  were  used, 

(b)  the  analysis  of  variance.  It  is  a  useful  practice  to  mark  with  asterisks 
the  significant  terms,  one  (*)  for  the  5%  level,  two  (**)  for  the  1%  level,  and 
three  (***)  for  the  0.1%  level. 

(c)  If  we  are  interested  in  components  of  variance  then  we  should  calculate 
those  which  are  significant.  If  we  are  interested  in  means  then  these  should  be 
given.  In  the  case  where  two  factors  interact,  a  two-way  table  is  necessarv, 
e.g.  Table  29. 

(d)  In  general  it  is  not  necessary  to  give  the  original  data,  particularly  when 
it  is  extensive  :  the  statistics  specified  above  contain  the  essential  facts.  How- 
ever, when  the  original  data  is  not  very  extensive,  there  is  no  harm  in  giving  it. 

(g)  The  Latin  Square 

A  statistical  device  that  was  originally  introduced  by  Fisher  to  overcome 
difficulties  arising  out  of  differences  in  soil  fertility  in  agricultural  experiments  is 
the  Latin  Square. 
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Suppose  we  wished  to  investigate  the  effects  of  two  factors,  P  and  Q,  each 
at  four  levels,  upon  a  process.  P  might  be  the  pressure  at  which  cordite  was 
pressed  and  Q  might  be  the  angle  of  the  choke' plate  in  the  press.  Then  we 
would  carry  out  the  experiment  below :  i.e.  16  experiments  are  carried  out,  one 
for  each  value  of  Q  for  each  value  of  P,  thus  PxCh,  PiQg,  PiQ3,  PiQi,  P2Qi, 
PaQ2  and  so  on. 


Qi         Q2         Q3         Q4 

Pi 
pa 

p3 
p< 

To  the  first  order,  our  comparison  of  the  Qs  is  independent  of  our  com- 
parison of  the  Ps  (ignoring  interactions),  and  the  results  of  such  an  experiment 
could  be  tested  for  significance  by  carrying  out  the  analysis  of  variance  for  rows 
and  columns. 

However,  suppose  that  the  material  for  pressing  comes  from  the  previous 
(incorporation)  stage  in  homogeneous  batches  of  such  a  size  that  only  (say)  ten 
pressings  could  be  made  from  one  batch.  Then  we  could  not  carry  out  the  above 
design,  because  it  involves  16  pressings,  and  thus  part  of  the  series  would  be 
carried  out  on  one  batch  and  part  on  another  batch  and  possible  differences 
between  batches  would  be  hopelessly  mixed  up  with  the  P  and  Q  comparisons. 

This  difficulty  can  be  avoided  by  the  use  of  a  Latin  Square  design.  Consider 
the  arrangement  below,  where  A,  B,  C  and  D  represent  four  different  batches. 
Thus  with  batch  A,  four  readings  are  taken  in  which  for  the  first  P  is  at  Px  and 
Q  at  Qj,  the  second  P  at  P2  and  Q  at  Q4,  etc.     Similarly  for  the  other  batches. 


Qi 

Q2 

Q3 

Q4 

Pi 

A 

B 

C 

D 

p2 

B 

C 

D 

A 

p3 

C 

D 

A 

B 

p4 

D 

A 

B 

C 

It  will  be  noticed  that  each  row  contains  A,  B,  C  and  D  each  once  and  only 
once.  Thus  whatever  disturbing  effects  possible  differences  between  the  batches 
may  introduce  occur  equally  for  each  row,  and  hence  are  completely  eliminated 
when  we  take  the  average  of  the  rows  for  our  comparison  of  the  Ps. 

Exactly  similarly  each  column  contains  A,  B,  C  and  D  each  once  and  only 
once  and  all  disturbing  effects  due  to  differences  between  the  batches  are  elim- 
inated from  the  comparison  of  the  Qs. 

The  results  of  such  an  experiment  would  be  tested  for  significance  with  an 
analysis  of  variance  on  the  following  lines : — 

(1)  Square  every  observation  and  sum  these  squares. 

(2)  Obtain  the  total  for  each  row,  square  these  totals,  sum  these   squares, 
and  divide  this  total  by  the  number  of  individuals  in  each  row. 
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(3)  Similarly  for  columns. 

(4)  Similarly  for  batches, 

(5)  Square  the  grand  total  and  divide  by  the  grand  number  of  individuals. 
We  then  form  a  table  of  this  analysis  of  variance  (Table  12.9). 

Table  12.9 


Source  of 
Variance 

Sums  of 
Squares 

Degrees  of 
Freedom 

Mean 
Squares 

Rows 
Columns 
Batches 
Residual 

(2) -(5) 
(3) -(5) 
(4) -(5) 

n—  1 

n  — 1 

n  — 1 

(n-l)(n-2) 

Total 

(l)-(S) 

n2— 1 

n  =  size  of  square. 


The  Residual  Sum  of  Squares  is  as  usual  the  difference  between  the  Total  Sum 
of  Squares  and  all  the  other  components. 

The  various  mean  squares  are  tested  for  significance  with  Fisher's  variance 
ratio  test.  Here  we  are  principally  interested  in  the  Row  and  Column  effects, 
corresponding  to  our  factors  P  and  Q.  If  the  effects  prove  not  significant  we 
can  regard  it  as  there  being  no  significant  difference  between  the  P  (or  the  Qs). 
If  the  effect  proves  significant,  we  can  calculate  the  means  for  each  level  of  P 
(or  Q).  For  making  a  detailed  comparison  of  any  two  levels  of  P  (or  Q)  each 
mean  has  the  standard  error  given  by  the  square  root  of  the  Residual  Mean  Square 
divided  by  the  square  root  of  n. 

We  have  discussed  the  4x4  Latin  Square,  but  there  is  in  general  no  restric- 
tion upon  square  size.     Designs  are  available  for  sizes  up  to  12  X  12. 

One  of  the  assumptions  underlying  the  use  of  a  Latin  Square  is  that  there 
should  be  no  interaction  between  the  factors.  Since  the  existence  of  interaction 
in  industrial  chemical  systems  appears  to  be  common,  the  Latin  Square  may  not 
be  of  wide  applicability  in  this  field. 

An  alternative  method  for  dealing  with  the  situation  of  restricted  batch  size, 
where  we  need  to  carry  out  a  given  number  of  experiments  for  a  factorial  experi- 
ment and  the  batch  size  of  raw  material  is  not  large  enough,  is  known  as  con- 
founding (see  Chapter  XIV). 

The  data  in  Table  12.10  represents  the  results  of  an  exploratory  experiment 
on  the  first  stage  of  a  purification  process  which  involves  adsorbing  a  substance 
on  carbon.  Two  factors  were  to  be  investigated  at  five  levels  each,  pH  and  the 
quantity  of  carbon  as  a  percentage  of  the  volume  of  solution.  The  pH  was  made 
to  increase  in  equal  stages  and  the  quantity  of  carbon  in  equal  steps  on  a  logarithmic 
basis.  These  two  factors  form  the  rows  and  columns  respectively  of  Table  12.10. 
The  treatments  consist  of  five  different  grades  of  carbon. 
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Table  12.10 


%  Carbon 

Totals 

0.05 

0.10 

0.20 

0.40 

0.80 

pH  4.0 
5.0 
6.0 
7.0 
8.0 

17.A 
32.E 
56.C 
76.D 
93.B 

39.D 
33.C 
49.A 
81.B 
90.E 

65.B 
61.A 
84.D 
97.E 
97.C 

19.C 
71.B 
90.E 
98.A 
100.D 

12.E 

94.D 

100.B 

100.C 

100.A 

152 
291 
379 
452 
480 

Totals 

274 

292 

404 

378 

406 

1754 

Grade 
Totals 

A 

325 

B 
410 

C 
305 

D 
393 

E 
321 

The  analysis  of  variance  proceeds  as  follows  : — 

(1)  Square  every  observation  and  add. 

172  -f  392  +  652  +  .  .  .  +  1002  =  144452 

(2)  Obtain  the  total  for  each  row,  square  these  totals,  sum  these  squares,  and 
divide  by  the  number  of  individuals  in  each  row. 

(1522  +  2912  +  3792  +  4522  +  4802)/5  =  137226.0 

(3)  Similarly  for  columns. 

(2742  +  2922  +  4042  +  3782  +  4062)/5  =  126255.2 

(4)  Similarly  for  "treatments"  (here  grades). 

(325 2  +  4102  +  305 2  +  393 2  +  3212)/5  =  124848.0 

(5)  Square  the  grand  total  and  divide  by  the  grand  number  of  individuals. 

17542/25  =  123060.6 
We  can  now  form  a  table  of  the  anlysis  of  variance  (Table  12.11). 

Table  12.11 


Source  of  Variance 

Sums  of  Square 

Degrees  of 

Freedom 

Mean 
Squares 

Rows  (pH) 

Columns  (%  carbon) 
Treatments  (grades) 
Residual 

(2)  —  (5)  =  14165.4 

(3)  —  (5)  =    3194.6 

(4)  —  (5)  =    1787.4 

2244.0 

n  —  1  =  4 
n  —  1  =  4 
n  —  1  =  4 
(n  — l)(n  — 2)=12 

3541.35 
798.65 
446.85 
187.00 

Total 

(1)_(5)  =  21391.4 

n2  —  1  =  24 

n  =  size  of  square. 


The  Residual  sum  of  squares  is  obtained  as  the  difference  between  the  Total 
Sum  of  Squares  and  all  the  other  components.  With  the  variance  ratio  test  we 
find  that  the  pH  effect  reached  the  0.1%  significance  level  and  the  %  Carbon 
effect  the  5%  level.     The  grade  effect  does  not  reach  the  5%  level. 

For  making  a  comparison  of  any  two  levels  of  any  of  the  three  factors,  we 
may  note  that  the  standard  error  of  any  row,  column,  or  treatment  average  is 
a /y  n  where  n  is  the  side  of  the  square.     In  the  present  case,  therefore,  the  standard 
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error  is  Vl87"0/V5  =  6.12.  The  difference  of  two  such  averages  will  therefore 
have  the  standard  error  V2  (6.12)  =  8.65  and  since  the  degrees  of  freedom  are 
12  we  have  for  the  95%  confidence  limits  ±  2.18  X  8.65  =  ±  18.8.  It  should 
be  remembered,  of  course,  that  these  limits  only  refer  to  two  averages  chosen  at 
random,  they  do  not  apply  to  the  largest  and  smallest  chosen  because  they  are 
the  largest  and  smallest. 

It  is  interesting  to  proceed  a  little  further  with  this  data  and  partition  the  4 
degrees  of  freedom  for  %  carbon  into  linear,  quadratic,  cubic,  and  quartic  com- 
ponents in  the  manner  indicated  in  Section  (c).     We  obtain  the  result : — 


Linear 

2450.0 

Quadratic 

195.9 

Cubic 

32.0 

Quartic 

513.6 

Total 

3194.5 

Remembering  that  the  residual  mean  square  is  187.0  with  12  degrees  of  free- 
dom we  would  find  none  of  these  significant  except  the  linear  component.  We 
might  therefore,  conclude  that  there  is  no  evidence  for  departure  from  linearity. 
We  reach  this  conclusion  by  postulating  the  hypothesis  that  the  most  reasonable 
form  of  functionality  is  the  straight  line  and  after  that  the  parabola  and  so  on. 
What  we  are  overlooking  is  that  from  what  is  known  of  the  theory  of  adsorption 
this  is  not  the  most  reasonable  hypothesis.  On  the  contrary  it  might  be  expected 
that  the  percentage  adsorption  would  rise  to  a  maximum  and  stay  at  that  maximum 
no  matter  how  much  more  carbon  was  used.  In  particular,  of  course,  the  adsorp- 
tion cannot  exceed  100%.  A  glance  at  the  column  totals  in  Table  12.10  shows 
that  this  is  probably  what  is  happening  ;  the  adsorption  has  risen  to  its  maximum 
-by  the  time  the  %  carbon  has  reached  0.20%  and  raising  the  %  carbon  to  0.80% 
produces  no  further  increase  (the  slight  dip  at  0.40%  carbon  is  obviously  experi- 
mental error). 

This  system  is  an  example,  therefore,  of  how  it  is  well  to  consider  whether 
the  hypothesis  that  one  is  testing  is  an  appropriate  one.  The  emphasis  which 
lies  upon  null  hypotheses  in  present-day  statistics  makes  it  rather  easy  to  fall  into 
an  attitude  of  mind  which  accepts  the  null  hypothesis  of  no  difference  between  the 
two  factors  and  denies  the  evidence  for  the  difference  between  the  two  factors 
when  it  would  be  more  reasonable  to  regard  as  one's  null  hypothesis  that  there 
was  a  difference  and  accept  that. 

It  might  also  be  emphasised  that  the  experiment  as  carried  out  was  exploratory 
and  not  intended  to  be  of  great  precision.  For  example,  in  its  given  form  there 
can- be  no  detection  of  the  existence  of  interactions.  However,  it  served  its  purpose 
of  finding  out  what  were  likely  to  be  the  useful  ranges  of  the  variables  pH  and 
quantity  of  carbon,  and  showed  that  it  was  unlikely  there  was  any  great  difference 
between  the  different  grades  of  carbon.  This  was  achieved  with  25  runs,  whereas 
the  full  factorial  experiment  would  have  required  53  =  125  runs.  Having  found 
what  are  the  useful  ranges  of  the  variables,  these  can  be  concentrated  on  if  necessary, 
and  it  may  be  that  a  3s  experiment  would  be  satisfactory  using  the  three  grades 
of  carbon  that  came  out  best.  This  could  be  confounded  in  3  blocks  of  9  (see 
Chapter  XIV)  and  would  give  enough  degrees  of  freedom  for  error  to  be 
reasonably  accurate. 

(h)  Graeco-Latin  and  Higher  Order  Squares 

Consider  the  Latin  Square  in  Table  12.12.  It  is  possible  to  construct  another 
Latin  Square,  using  Greek  Letters  (Table  12.13)  such  that  when  they  are  super- 
imposed (Table  12.14)  to  each  Latin  letter  corresponds  all  Greek  letters  and 
vice-versa.     Such  a  square  is  called  "Graeco-Latin." 
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Table 

12.12 

A 

B 

C 

D 

E 

B 

C 

D 

E 

A 

C 

D 

E 

A 

B 

D 

E 

A 

B 

C 

E 

A 

B 

C 

D 

Table  12.13 

a 

P 

Y 

s 

e 

8 

e 

a 

p 

Y 

P 

Y 

s 

e 

a 

e 

a 

p 

Y 

8 

T 

S 

e 

a 

P 

4 

Table  12.14 


Aa 

BP 

Cy 

DS 

Ee 

BS 

Ce 

Da 

EP 

Ay 

Q> 

DY 

ES 

Ae 

Ba 

Ds 

Ea 

AP 

By 

CS 

Ey 

AS 

Be 

Ca 

Dp 

The  applications  of  this  to  experimental  design  are  obvious.  If  as  in  the 
previous  example  different  columns  were  different  angles,  the  rows  different 
pressures,  and  Latin  letters  different  batches,  then  the  Greek  letters  could  be  any 
further  factor  such  as  rate  of  pressing. 

The  analysis  of  variance  proceeds  exactly  as  in  the  previous  section,  except 
that  now  there  is  a  further  item  derived  exactly  in  the  same  way  as  the  first  three 
in  Table  12.11,  again  with  (n —  1)  degrees  of  freedom.  The  Sum  of  Squares  for 
the  Residual,  obtained  by  difference  as  before,  now  has  (n  —  1)  (n  —  3)  degrees 
of  freedom. 

It  is  possible  to  superimpose  on  Table  12.14  a  fifth  factor  which  still  satisfies 
the  appropriate  conditions,  when  the  degrees  of  freedom  for  the  Residual  will 
be  (n  —  1)  (n  —  4),  or  in  this  case  4. 

Generally  speaking,  except  for  n  =  6,  on  a  square  of  side  n  we  can  super- 
impose n  factors  still  leaving  (n  —  1)  degrees  of  freedom  for  the  residual.  These 
designs  can  thus  be  extremely  useful  for  rough  surveys,  as  for  example  in  a  total 
of  25  runs  we  can  get  good  estimates  of  5  factors  each  at  5  levels.  The  disadvan- 
tages, of  course,  is  that  they  do  not  estimate  interactions  :  nevertheless,  for  a' 
quick  survey  of  a  new  field,  in  which  we  are  principally  interested  in  the  main 
effects  and  propose  to  examine  those  which  turn  out  to  be  of  practical  importance 
in  more  detail  later,  these  designs  are  valuable.  (4) 

Fisher  and  Yates'  Tables  (Table  XVI)  give  the  structure  of  such  squares 
from  n  =  3  to  9.  It  is  remarkable  that  for  n  =  6  nothing  higher  than  a  Latin 
Square  is  possible., 


(i)  The  Theory  of  Chemical  Sampling!5) 

(i)  The  techniques  outlined  above  can  be  made  the  basis  of  a  scientific  study 
of  sampling,  a  field  that  has  been  not  adequately  dealt  with  in  chemical  industry. 

H)  An  interesting  example  of  the  application  of  a  7  x  7  Graeco-Latin  square  to  the 
testing  of  motor  fuel  consumption  is  described  by  H.  M.  Davies,  Journal  of  the  Institute 
of  Petroleum,  Vol.  32,  p.  465,  1946,  the  four  independent  variables  corresponding  to  rows, 
columns,  and  the  two  alphabets  being  day,  time  of  day,  driver,  and  fuel. 

(5)  Interesting  papers  on  sampling  problems  are  :  D.  J.  Finney  :  Biometrics  Bulletin, 
Vol.  2,  No.  1,  1946,  page  1.  C.  S.  Pearce  :  Journal  of  the  Royal  Statistical  Society,  Vol. 
CVII.page  117,1944.  F.Yates  :,  Journal  of  the  Royal  Statistical  Society,  Vol.  C IX,  page  12, 
1946.  Chapter  17  of  G.  W.  Sn'edecor's  "Statistical  Methods"  4th  Edition  (Iowa  State 
College  Press)  is  also  stimulating. 
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Thus  consider  the  variances  derived  for  the  problem  in  Chapter  XI  (f).  • 

Between  Vats  =  ox2  =  1.09 

Between  Bags  within  Vats  =  a22  =  0.00 

Between  Samples  within  Bags  =  o32  =  1.85 

Between  Analyses  within  Samples  =   a42  =  0.83 

Suppose  we  have  one  analysis  on  one  sample  from  one  bag  from  one  vat. 
This  will  have  a  variance  of  (1.09  -f-  0.00  +  1.85  +  0.83)  =  3.77  or  a  standard 
deviation  of  1.94.  It  will  then  have  a  95%  chance  of  being  less  than  1.96  x  1.94 
=  3.80  out  from  the  true  mean  (1.96  is  the  value  of  t  for  infinite  degrees  of  freedom 
for  the  5%  level  of  significance). 

If  this  is  too  inaccurate  an  estimate  of  the  quality  of  the  batch,  we  might 
reduce  the  error  in  our  estimate  by  doing  duplicate  analyses.  It  would  then 
have  a  variance  of  (1.09  +  0.00  +  1.85  +  0.83/2)  =  3.35  or  a  standard  deviation 
of  1.83.  It  is  clear  that  this  is  very  little  improvement  upon  our  original  estimate 
of  1.94  based  on  a  single  analysis. 

We  might  take  a  pair  of  samples  from  a  bag,  and  perform  single  analyses 
upon  them.  Our  estimate  would  thus  have  a  variance  of  (1.09  +  (1.85  -f-  0.83)/2) 
=  2.43,  or  a  standard  deviation  of  1.56.  This  is  a  definite  improvement  upon  our 
earlier  estimates. 

Alternatively  we  might  take  a  pair  of  samples,  one  from  one  bag  from  one 
vat  and  the  other  from  another  bag  from  another  vat.  Our  estimate  of  the  mean 
batch  quality  would  now  have  a  variance  of  (1.09  +  0.00  -f  1.85  +  0.83)/2=  1.88 
or  a  standard  deviation  of  1.37. 

It  is  thus  clear  that  it  is  much  less  profitable,  if  we  are  going  to  have  the 
laboratory  doing  two  analyses,  to  have  them  doing  duplicate  analyses  upon  the 
same  sample  or  separate  analyses  from  two  samples  from  the  same  bag  than  have 
them  doing  separate  analyses  upon  samples  drawn  from  separate  bags  from 
separate  vats. 

If  we  are  dealing  with  a  chemical  substance,  samples  of  which  can  be  blended, 
such  as  liquids,  powders,  etc.,  then  a  further  possibility  is  open  to  us.  We  can 
take  a  pair  of  samples,  one  from  one  bag  from  one  vat  and  one  from  one  bag 
from  another  vat,  blend  them,  and  carry  out  one  analysis  upon  the  blended  sample. 
Our  estimate  would  now  have  an  error  variance  of  ((1.09  -j-  0.00  -f-  1.85)/2  -f- 
0.83)  =  2.30  or  a  standard  deviation  of  1.52.  Or  if  we  took  four  samples  from  four 
vats  and  blended  them  and  carried  out  one  analysis  upon  the  blended  sample 
our  estimate  would  have  an  error  variance  of  ((1.09  -f-  0.00  +  1.85)/4  +  0.83)  = 
1.56  or  a  standard  deviation  of  1.25.  Thus  with  a  little  extra  trouble  in  sampling, 
and  no  more  work  on  the  part  of  the  laboratory,  we  have  got  an  estimate  of  the 
batch  mean  quality  with  an  error  standard  deviation  of  1.25,  to  be  compared  with 
the  simple  estimate  from  one  sample  of  1.94. 

It  is  on  the  basis  of  such  an  exploratory  analysis  of  variance  that  we  can 
deduce  what  is  the  error  of  our  present  sampling  scheme  and  if  the  latter  proves 
to  have  too  high  an  error  the  analysis  of  variance  will  show  the  most  economical 
method  of  improving  its  accuracy. 

(ii)  Further  examples  of  the  application  of  the  analysis  of  variance  to  the 
study  of  chemical  sampling  are  given  below.  An  allegedly  homogeneous  blend 
had  been  bagged.  To  test  its  homogeneity  three  analyses  were  performed  on 
three  samples  taken  from  three  bags  from  the  blend.  The  material  was  nitro- 
cellulose, and  it  was  analysed  for  nitrogen  content  and  viscosity. 

131 


Nitrogen  (%) 


V           Bag 
Sample  ^""^ 

1 

2 

3 

I 

12.20 
12.22 
12.21 

12.23 
12.24 
12.23 

12.19 
12.21 
12.22 

II 

12.21 
12.20 
12.19 

12.22 
12.23 
12.25 

12.24 
12.21 
12.23 

III 

12.21 
12.22 
12.20 

12.22 
12.21 
12.24 

12.22 
12.22 
12.22 

Viscosity  (poises) 

"\       Bag 
Sample\, 

1 

2 

3 

I 

116 
112 
114 

124 
130 
126 

109 
128 
119 

II 

128 
127 
132 

134 
114 
127 

114 
127 
119 

III 

116 
118 
117 

126 
129 
132 

119 
119 
130 

The  data  was  analysed  as  in  Chapter  XI  (i),  with  results  as  below  : 


Source  of  Variance 

Degrees  of 
Freedom 

Nitrogen 

Viscosity 

Sums  of 
Squares 

Mean 
Squares 

Sums  of 
Squares 

Mean 
Squares 

Between  Bags 
Between  Samples 

within  Bags 
Between  Analyses 

within  Samples 

2 

6 

18 

24.51 
10.23 
25.33 

12.25 
1.70 
1.40 

267.56 
427.11 
614.00 

133.78 
71.19 
34.11 

Total    .. 

26 

60.07 

1308.67 

Units  for  nitrogen  :    hundredths  of  a  per  cent. 
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In  the  case  of  Nitrogens,  the  Between  Sample  Mean  Square  is  not  significant 
but  the  Between  Bags  is,  at  the  1%  level.  In  the  case  of  Viscosity,  the  Between 
Samples  Mean  Square  is  not  significant  but  the  Between  Bags  Mean  Square  is 
nearly  significant  at  the  5%  level. 

It  is  thus  apparent  that  there  is  appreciable  inhomogeneity  in  the  blend 
from  bag  to  bag  but  not  within  bags.  We  can  calculate  the  components  of  variance 
as  below : 


Source  of  Variance 

Nitrogen 

Viscosity 

Between  Bags 
Between  Samples  . . 
Residual 

1.20 
0.00 
1.48 

10.0 

0.0 

43.4 

Total 

2.68 

53.4 

The  figure  under  "Total"  is  the  variance  of  a  single  analysis  upon  a  sample 
from  one  bag. 

To  effect  improvement  in  the  nitrogen  figure,  we  need  to  concentrate  atten- 
tion upon  both  operating  sources  of  error.  It  is  not  sufficient  to  take  n  samples 
and  bulk  them  and  perform  one  analysis  on  the  blended  sample :  thus  if  n  was 
4,  the  variance  of  the  estimate  thus  obtained  would  be  (1.48  +  1.20/4)  =  1.78. 
This  is  an  improvement,  but  no  matter  how  many  samples  we  bulk,  so  long  as 
we  perform  only  one  analysis  the  variance  will  never  be  less  than  1.48.  If  we 
performed  two  analyses  upon  a  blended  sample  derived  from  four  bags,  the 
variance  of  the  mean  would  be  (1.20/4  +  1.48/2)  ==  1.04. 

With  the  viscosity  determination,  it  is  clear  that  our  main  efforts  must  be 
directed  at  obtaining  more  analyses.  If  we  had  four  analyses  upon  one  sample 
the  variance  of  our  estimate  of  the  mean  would  be  (10  -f  43.4/4)  =  20.8,  or  four 
analyses  upon  separate  samples  gives  a  variance  of  (10/4  -f  43.4/4)=  13.3. 
The  gain  by  taking  separate  samples  is  appreciable,  therefore. 


(j)  The  Homogeneity  of  Data 

At  the  end  of  Chapter  III  on  the  comparison  of  means  it  was  mentioned 
that  complications  arise  in  the  calculation  of  standard  deviations  if  the  observations 
are  not  independent  of  each  other. 

In  general,  with  an  analysis  of  variance  between  and  within  batches  (One 

Factor  with  Replication)  if  we  have  m  batches  of  n  individuals  then  let  us  define 

the  simple  variance  given  by  the  Total  Mean  Square  (i.e.  the  variance  we  would 

calculate  if  we  ignored  the  inhomogeneity  of  the  data  and  regarded  it  as  referring 

to  nm  independent  individuals)  as  at2.      Then 

m(n  — 1) 

at2  =  — s —  oa*  +  gb* 

mn —  1 

where  oa2  and  <tb2  are  the  components  of  variances  corresponding  to  differences 

between  batches  and  differences  within  batches. 

Rearranging,  this  can  be  expressed  as 

„,,=[1_fe^8A.+<ai. 

L         mn  —  1 

It  is  evident  that  at2  is  always  smaller  than  ax2  =  oa2  -f  ctb2,  and  the  error 
will  be  greatest  when  ob2/°a2  tends  to  0  and  when  m  is  large  and  n  =  2.  Under 
these  circumstances  at2  is  only  half  <?t2,  the  correct  value  for  the  total  variance. 

Consider  the  block  of  30  observations  discussed  in  Chapter  VII  (b),  Analysis 
Between  and  Within  Columns.  Suppose  we  wished  to  calculate  the  standard 
deviation  of  the  mean  of  these  observations. 
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If  we  ignored  the  fact  that  these  30  observations  are  not  independent  of  each 
other,  but  are  actually  10  groups,  each  group  being  3  observations,  we  should 
calculate  the  variance  from  its  ordinary  definition  which  corresponds  to  terms 
(1)  —  (3)  in  that  section  divided  by  the  total  degrees  of  freedom,  i.e.  203.87/29 
=  7.030,  and  the  standard  deviation  of  the  individual  observations  would  be 
2.65,  and  the  standard  deviation  of  the  mean  would  be  2.65/V30  =  0.484. 

However,  this  is  a  false  result,  for  our  mean  is  not  the  mean  of  30  independent 
individuals,  but  actually  the  mean  of  10  column  means. 

The  variance  of  each  column  mean  is  (asa/3  -f-  <ja2)  and  since  there  are  10 
column  means  the  variance  of  the  grand  mean  is  (aB2/3  +  <JA2)/10,  i-e-  (2.17/3 
-f-  5.22)/10  =  0.594.  The  standard  deviation  of  the  grand  mean  is  thus 
V0594  =  0.771. 

This  true  standard  error  of  the  grand  mean  of  0.771  is  very  much  larger 
than  the  false  one  of  0.484  we  obtained  by  ignoring  the  inhomogeneity  of  the 
data.  If  we  had  been  using  the  standard  error  to  compare  the  mean  with  some 
other  value  using  the  Student  t  test  we  would  have  fallen  into  serious  errors,  for 
not  only  is  the  true  value  of  the  standard  error  of  the  grand  mean  much  larger 
than  the  incorrect  one,  but  also  the  true  degrees  of  freedom  are  only  9,  not  29. 
This  is  why  it  is  in  general  very  desirable  to  plan  experiments  of  this  type  in  a 
form  suitable  for  the  application  of  the  analysis  of  variance. 

Thus,  if  the  thirty  units  (ten  batches)  referred  to  had  been  made  of  one 
consignment  of  raw  material,  and  we  wished  to  compare  the  grand  mean  of  another 
consignment,  it  would  be  most  convenient  to  make  ten  batches  (thirty  units)  with 
the  second  consignment.  This  would  then  be  suitable  for  an  analysis  of  variance 
as  below : 


Source  of  Variance 

Degrees  of 
Freedom 

Sums  of 
Squares 

Mean 
Squares 

Between  Consignments 
Between  Batches  within 
Consignments 
Between  Units  within 
Batches 

(2-1)=  1 
2(10—1)=  18 

2  x  10(3—  1)  =  40 

Total        

2  x  10  x  3  —  1 
=  59 

It  will  be  noted  that  it  would  not  be  possible  to  apply  this  technique  if  the 
experiment  had  been  done  haphazardly,  unplanned,  so  that  units  were  missing 
from  some  of  the  batches  and  the  numbers  of  batches  made  from  each  consignment 
were  unequal.  (6) 


(k)  The  Use  of  Logarithms  in  the  Analysis  of  Variance 

We  discussed  earlier  the  possibility  of  using  the  logarithm  of  the  dependent 
variable  rather  than  the  dependent  variable  itself  in  cases  where  deviations  were 
likely  to  be  proportional  to  the  mean,  i.e.  constant  if  we  use  logarithms. 

An  example  of  this  is  occurring  in  the  case  of  acid  plant  throughputs  dis- 
cussed in  Chapter  XI  (i)  (Table  1 1 .22).  The  Between  Factories  Within  Acid  Group 
term  is  not  significant,  so  we  can  form  a  pooled  residual  variance  with  3x6=18 
degrees  of  freedom  for  each  type  of  weak  acid.     These  become  57.19,  405.21, 

(6)  Of  course,  incomplete  analyses  can  be  carried  out,  in  the  simpler  cases,  e.£.  Section 
VII  (d),   without   excessive   difficulty.  In   the  more   complex   cases,   however,   the   com- 

putational labour  becomes  excessive. 
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and  1197.84  respectively.  These  are  severely  inconsistent  when  tested  with  the 
variance  ratio  test.  However,  the  standard  deviations  are  7.56,  20.13,  and  34.61, 
and  when  expressed  as  fractions  of  the  means  26.7,  51.0,  and  103.3,  we  get  0.294, 
0.396  and  0.334  respectively.  It  seems  that  these  fractions  are  roughly  constant, 
and  we  are  encouraged  to  try  an  analysis  of  the  logarithms  of  the  data. 

The  natural  logarithms  of  one-tenth  of  the  throughput  was  selected  as  being 
the  most  convenient  function  to  handle.  Residual  variances  of  0.04489,  0.22450, 
and  0.08891  were  obtained,  each  with  18  degrees  of  freedom,  for  each  of  the  three 
acid  types  as  before.  While  not  strictly  consistent,  these  are  much  more  so  than 
the  simple  variable.  The  Analysis  of  Variance  on  the  logarithm  of  the  data  is 
in  Table  12.15. 

Table  12.15 
Analysis  of  Variance  of  Logarithm  of  Acid  Plant  Throughputs. 


,    Source  of 
Variance 

Degrees  of 
Freedom 

Sums  of 
Squares 

Mean 
Squares 

Type  of  weak  acid 
Factories 
Within  Factories 

2 

6 

54 

11.4670 
0.6769 
6.4494 

5.7335 
0.1128 
0.1194 

Total 

62 

18.5933 

The  conclusions  to  be  reached  from  the  above  analysis  are  identical  with 
those  obtained  from  Table  11.23,  the  analysis  of  the  simple  data. 

This  is  a  reassuring  aspect  of  the  analysis  of  variance  :  we  can  disregard  the 
underlying  assumptions  with  comparative  impunity. 

(1)  Other  Transformations  in  the  Analysis  of  Variance(') 

Sometimes  in  an  analysis  of  variance  the  dependent  variable  is  not  a  continuous 
variable  but  discontinuous. 

For  example,  in  discussing  blemishes  in  objects,  the  number  of  blemishes 
of  course  has  the  lower  limit  zero  but  can  go  to  very  large  numbers.  Technically, 
the  distribution  is  probably  Poissonian.  Under  these  circumstances  it  is  more 
appropriate  to  use  the  square  root  of  the  number  of  blemishes  as  the  dependent 
variable (8).  Where  the  number  of  blemishes  are  in  many  cases  less  than  10,  it 
is  slightly  preferable  to  use,  where  x  is  the  number  of  blemishes,    Vx  -{-1/2 

In  the  case  where  the  number  of  incidents  x  being  counted  has  a  finite  upper 
limit,  being  able  to  range  from  O  to  X,  it  is  best  to  take  the  angle  9. 

6  =  sin-VxTX 
For  example,   if  x  =  40,   X  =  120,   then    Vx/X  =  Vl/3  =  0.5774,   and   sin-1 
0.5774  =  35.2°. 

A  convenient  table  of  the  transformation  is  reproduced  in  the  Appendix 
(Table  VI). 


(m)  Missing  Values  (8) 

Occasionally  runs  fail  to  produce  results  :    the  sample  may  get  lost  or  any 
similar  accident  may  happen.     In  the  case  of  a  two  factor  experiment,  of  r  rows 

(7)  A  very  interesting  paper  on  the  effects  of  various  transformations  in  a  3  x  3  x  2  x  2 
experiment  is  by  Thorp  et  al :  Phytopathology  XXXI,  page  26,  1941.  A  recent  survey 
of  this  field  is  by  M.  S.  Bartlett :    Biometrics,  Vol.  3,  No.  1,  page  39,  1947. 

(8)  M.  S.  Bartlett :  Supplement  to  the  Journal  of  the  Roval  Statistical  Society,  3  :  6S 
(1936). 

(9)  A  full  discussion  with  bibliography  of  missing  plot  techniques  is  by  R.  L.  Anderson  : 
Biometrics  Bulletin,  Vol.  2,  No.  3,  page  41,  1946. 
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and  c  columns,  one  missing  value  may  be  efficiently  estimated  by  the  formula (10) 
_  rR  +  cC  —  T 
(r— l)(c— 1) 
where       r    =  number  of  rows 

c     =  number  of  columns 
R   =  total  for  the  row  with  the  missing  item 
C    =  total  for  the  column  with  the  missing  item 
T    =  grand  total 
Thus,  in  Table  12.1,  if  the  value  of  T2W2  (actually  the  observed  value  was  27) 
had  been  missing,  we  would  have  estimated  it  as 
_  4  x  131  +  4  x  78  —  481 
A  ~  (4—1)  (4—1)  7  ** 

In  carrying  out  the  analysis  of  variance,  this  value  is  inserted  as  though  it 
had  actually  been  observed,  and  the  analysis  then  proceeds  normally.  However, 
the  degrees  of  freedom  for  the  Residual  and  for  the  Total  are  both  decreased  by 
unity. 

It  is  wiser  to  regard  conclusions  from  an  experiment  with  a  missing  observa- 
tion with  rather  more  caution  than  if  it  had  been  complete.  That  is  to  say,  we 
had  better  only  attach  something  of  the  order  of  a  5%  significance  to  what 
apparently  is  a  1%  significance. 

When  two  observations  are  missing,  they  can  be  inserted  by  an  iterative 
method.  We  insert  for  Xj  an  assumed  value,  say  the  grand  mean,  and  then 
calculate  the  expected  value  of  X2.  Using  this  value  of  X2,  we  delete  the  assumed 
value  for  Xx,  and  then  calculate  its  expected  value,  say  XV  Using  this  value  of 
X'x  we  delete  the  previous  value  X2  and  recalculate  its  new  value  X'2.  Using  this 
new  value  X'2  we  delete  X'x  and  recalculate  a  new  value  for  Xx,  Xx".  After  a 
very  few  cycles  of  this  process  the  new  values  will  be  the  same  as  the  old.  The 
analysis  of  variance  then  proceeds  with  two  degrees  of  freedom  subtracted  from 
both  the  Residual  and  the  Total. 

It  is  obvious  that  an  inserted  value  can  never  be  as  good  as  an  observed  one, 
and  so  every  effort  should  be  made  to  avoid  having  to  handle  incomplete  data. 
Where  the  lack  of  balance  is  on  a  large  scale,  the  simplest  case  can  be  handled 
rigorously  with  little  increase  in  trouble  as  in  Chapter  VII  (d).  SnedecorC11)  (page 
281-301)  discusses  some  more  complicated  instances.  However,  in  practice  we 
can  frequently  obtain  a  moderately  satisfactory  result  by  discarding  sufficient  data 
randomly  from  the  larger  classes  so  that  we  have  only  one  or  two  missing  values 
which  can  be  inserted  by  the  methods  above. 

(n)  The  Assumptions  Underlying  the  Analysis  of  Variance 

Eisenhart(12)  and  Cochran(13)  have  recently  discussed  the  practical  signi- 
ficance of  the  assumptions  underlying  the  usual  analysis  of  variance.  Several  of 
these  relate  to  the  residual  variance. 

It  is  assumed  firstly  that  the  residual  variance  has  the  Normal  or  Gaussian 
distribution.  The  effects  of  non-normality  have  been  studied  only  in  certain 
relatively  simple  cases  and  in  these  it  was  found  that  one  was  not  likely  to  be  led 
into  serious  errors  by  the  moderate  types  of  non-normality  usually  encountered 
in  practice.  It  can  be  reasonably  assumed  that  this  will  also  be  true  for  the  more 
complex  cases  occurring  in  the  analysis  of  variance.  It  is,  however,  preferable, 
when  we  know  that  our  dependent  variable  is  non-normally  distributed  and  when 

(10)F.  Yates:    Empire  Journal  of  Experimental  Agriculture,  1,  129,  1933. 
(H)  G.  W.  Snedecor  :    "Statistical  Methods  applied  to  Experiments  in  Agriculture  and 
Biology,"  (Iowa  State  College  Press),  4th  Edition,  1946. 

(12)  Churchill  Eisenhart,   "The  Assumptions    Underlying   the  Analysis  of  Variance," 
Biometrics,  Vol.  3,  page  1,  March,  1947. 

(13)  W.  G.  Cochran,  "Some  Consequences  when  the  Assumptions  for  the  Analysis  of 
Variance  are  not  Satisfied,"  Biometrics,  Vol.  3,  page  22,  March,  1947. 
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we  know  which  transformation  (generally  of  the  type  discussed  in  the  previous 
two  sections)  will  make  it  normal,  to  employ  it. 

It  is  further  assumed  that  the  experimental  errors  have  a  common  variance 
(in  the  case  of  the  simpler  analyses  :  there  are  some,  the  so-called  split-plot 
experiments,  in  which  in  effect  there  are  several  errors  involved).  A  common 
source  of  departure  from  this  assumption  is  where  the  error  is  correlated  with  the 
variable.  An  example  of  this  was  discussed  in  Section  (k)  where  this  effect  was 
very  marked  but  nevertheless  made  little  difference  to  the  results  of  the  analysis 
of  variance.  Another  circumstance  that  would  lead  to  non-satisfaction  of  this 
assumption  is  for  the  blocks  in  a  confounded  experiment  to  correspond  to  different 
batches  of  raw  material  with  varying  degrees  of  homogeneity.  Or  it  may  be  that 
one  particular  level  of  a  factor,  or  combination  of  levels  of  several  factors,  leads 
to  an  abnormal  pH  region  in  which  results  tend  to  be  more  erratic.  It  is  com- 
paratively rarely  that  one  has  sufficient  observations  to  detect  this  sort  of  phenom- 
enon, and  the  most  one  can  usually  do  is  to  hope,  in  the  main  reasonably,  that 
if  it  is  present  it  will  not  distort  our  significance  levels  unduly. 

A  further  assumption  is  that  the  experimental  errors  are  independent.  In 
experimental  work  this  condition  can  be  satisfied  by  the  use  of  the  randomisation 
technique  outlined  in  Chapter  I  (n). 

In  randomised  block  experiments  and  in  confounded  factorial  experiments, 
it  is  assumed  that  block  x  treatment  interactions  do  not  exist.  That  is  to  say, 
it  is  assumed  that  treatment  effects  and  block  effects  are  additive.  In  the  case  of 
randomised  blocks  this  would  be  easy  enough  to  check,  by  explicitly  calculating 
the  block  x  treatment  interaction,  but  this  procedure  cannot  be  used  in  a  confounded 
factorial  experiment.  It  would  seem,  however,  that  departures  from  this  assump- 
tion would  have  to  be  severe  before  they  led  us  into  serious  error. 


CHAPTER  XIII 
BALANCED  INCOMPLETE  BLOCKS 

(a)  Introduction 

A  general  description  of  the  experimental  design  known  as  balanced  incom- 
plete blocks  have  been  given  in  Chapter  I  Section  (f),  and  will  not  be  repeated 
here.  This  chapter  will  give  the  methods  of  computation  for  such  experiments, 
the  method  and  notation  being  based  on  Yates*1). 

Yates  has  developed  a  rather  more  sensitive  method  of  analysing  the  results (*) 
but  generally,  particularly  when  the  number  of  blocks  in  the  experiment  is  not  very 
large,  the  increase  in  accuracy  is  not  very  great,  and  the  earlier  method  of  analysis 
reproduced  here  will  normally  suffice. 

(b)  Computation 

Table  13.1  gives  the  results  of  an  experiment  for  comparing  7  treatments 
in  7  blocks  of  3  units,  there  thus  being  3  replications  of  each  treatment.  Block  A, 
for  example,  contained  treatments  Tx,  T3,  and  T5,  and  the  figures  in  the  table 
in  the  columns  A  to  G  are  the  yields  obtained. 


<*)  F.  Yates  :   Annals  of  Eugenics,  Vol.  7,  page  121,  1936. 
(")  F.  Yates  :   Annals  of  Eugenics,  Vol.  10,  p.  317,  1940. 
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Table  13.1 


\^Blocks 

Corrected 

\. 

A 

B 

C 

D 

E 

F 

G 

Totals 

Q 

V 

Treatment 

Treatments^ 

Means 

Tl 

SO 

42 

9i 

183 

-84 

— 12.000 

57238 

Ta 

1x8 

94 

94 

306 

163 

23285 

92.523 

Tg 

76 

64 

80 

220 

69 

9-857 

79095 
46.381 

tJ 

72 

53 

3i 

156 

— 160 

—22.857 

S 

44 

65 

54 

163 

—81 

— 11.571 

57-667 

I« 

102 

"9 

9* 

313 

254 

36.286 

105.524 
46.238 

T7 

3» 

38 

37 

"3 

—161 

— 23.000 

Totals 

170 

182 

281 

196 

278 

225 

122 

1454 

0 

0.000 

t=  7 

k  =  3 

r  =  3 

b  =  7 

N  =  tr  =  bk  =  21 

G  =  1454 

T, 


In  Yates'  notation — 

Number  of  treatments  = 

Number  of  units  per  block  = 

Number  of  replications  of  each  treatment       = 

Number  of  blocks  = 

Total  number  of  units  = 

Sum  of  all  N  observations  = 

Sum  of  all  r  observations  for  treatment  1        = 

Sum  of  all  k  observations  for  block  1  = 

Error  variance  of  a  single  observation  within 
blocks  of  k  units 
For  each  of  the  t  treatments  we  calculate  the  quantities  Q  given  by 

Q,  =  kTx  —  Bx  —  B2  —  .  .  .  —  Br 
where  the  Bs  for  Qx  are  the  totals  for  the  r  blocks  containing  treatment  1. 
example, 

Qx  =  3  x  183  —  170  —  182  —  281  =  —84 
The  Sum  of  Squares  for  treatments  is  then 

t  —  1  7  —  1 

SQ2=0i    '„,„      ,,(84'+  1632-f 


=     ok8 


For 


Nk  (k  —  1) 


1) 


+  1612) 


21x3(3- 
=  7665.905 
The  Sum  of  Squares  for  Blocks  is  obtained  from  the  Block  totals  in  the  usual 
manner : — 


k 
1 


Ga/N 


=  j  (1702  +  182*  +  ...  +  122*)  —  1454»/21  =  6725.810 

The  Total  Sum  of  Squares  is  also  as  usual 

50a  4-  42*  +  91*  +  H6a  +  •  •  •  37*  —  1454*/21  -  15057.810 
We  enter  up  these  items  in  a  table  of  analysis  of  variance  and  obtain  the  Error 
Sum  of  Squares  and  Degrees  of  Freedom  by  difference. 


Table  13.2 


Source  of 
Variance 

Degrees  of 
Freedom 

Sums  of 
Squares 

Mean 
Squares 

Blocks 

Treatments 

Error 

6 
6 

8 

6725.810 

7665.905 

666.095 

1120.968 

1277.651 

83.261 

Total 

20 

15057.810 
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The  Error  Mean  Square  is  ak2  and  the  standard  error  of  the  difference  between 
any  two  treatment  means  is 


=     8.448 

To  form  these  treatment  means,  corrected  for  block  differences,  we  obtain  the 
t  quantities 

v«  =  Wt0T) Q'  =  S0T)  ^  "  -12000 

These  vs  represent  the  deviation  of  each  of  the  t  treatments  from  the  grand  mean, 
which  is  G/N  =  1454/21  =  69.238.  The  corrected  treatment  mean  for  treat- 
ment 1  is  thus 

Vl  +  G/N  =  —12.000  +  69.238  =  57.238 

Should  we  also  be  interested  in  the  Block  means,  in  the  case  of  designs  such 
as  the  present  example,  in  which  every  pair  of  blocks  has  the  same  number  of 
treatments  in  common,  we  can  estimate  the  corrected  block  means  by  calculating 
Q's  similar  to  the  Qs,  e.g. 

Q'a=3x  170  —  183  —  220  —  163  =  —56 

-d  v'A  =  QA  5§^  =  56  J^iL  =  -8.000 
whence  the  corrected  mean  for  block  A  is  (69.238  +J— 8.000))  =  61.238. 

More  generally,  we  obtain  the  corrected  block  mean  as,  here, 
^  (170.000  —  57.238  —  79.095  —  57.667)  +  69.238  =  61.238 

In  the  particular  case  discussed,  where  each  pair  of  blocks  has  the  same 
number  of  treatments  in  common,  the  standard  error  of  the  difference  between 
corrected  block  means  is  the  same  as  that  quoted  above  for  corrected  treatment 
means.     In  the  more  general  case,  the  variance  is,  on  the  average. 

_       2<n,'A        (t-l)(t-k)\ 

Vb==— (,i  +  t(k-i)(b-i); 

which  of  course  gives  identical  results  in  the  present  instance. 


(c)  Possible  Designs 

A  list  of  possible  designs  is  given  in  Fisher  and  Yates'  Tables  (Table  XVII). 
Broadly  speaking,  if  the  number  of  treatments  and  the  size  of  block  are  both 
determined,  then  the  minimum  number  of  replications  of  each  treatment  is  fixed. 
If  we  are  anxious  to  have  only  a  small  number  of  replications  we  can  often  avoid 
the  straightforward  solution.  For  example,  working  with  blocks  of  4,  to  compare 
6  treatments  then  for  balance  we  require  10  replications.  If  we  are  of  the  opinion 
that  10  replications  is  more  than  sufficient  for  our  purpose,  we  can  instead  use  the 
design  for  7  treatments  which  requires  only  4  replications  :  the  seventh  treatment 
can  be  a  replication  of  one  of  the  6  or  an  additional  one  as  we  choose. 

The  most  useful  designs  are  that  already  given  (k  =  3,  b  =  7,  t  =  7,  r  —  3, 
that  is  to  say,  3  units  per  block,  7  blocks,  7  treatments,  3  replications)  and  those 
in  Table  13.3. 
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Table  13.3 


k  = 

3,b  = 

=  4,t 

=  4,r 

=  3 

] 

i*=4, 

b  =  5 

,t=5,r  = 

4 

A 

X 

X 

X 

A 

X 

X 

X 

X 

B 

X 

X 

X 

B 

X 

X 

X 

X 

C 

X 

X 

X 

C 

X 

X 

X 

X 

D 

X 

X 

X 

D 
E 

X 

X 

X 
X 

X 
X 

X 
X 

k 

=  4, 

b  =  7 

,t  = 

7,r  = 

4 

A 

X 

X 

X 

X 

B 

X 

X 

X 

X 

C 

X 

X 

X 

X 

D 

X 

X 

X 

X 

E 

X 

X 

X 

X 

F 

X 

X 

X 

X 

G 

X 

X 

X 

X 

Fisher  and  Yates*  Tables  give  a  number  of  other  interesting  solutions.  For 
example,  9  treatments  can  be  compared  in  blocks  of  3  with  4  replications,  or  13 
treatments  in  blocks  of  4  with  4  replications,  or  16  treatments  in  blocks  of  4  with 
5  replications,  or  21  treatments  in  blocks  of  5  with  5  replications. 

(d)  Other  Uses  for  Symmetrical  Incomplete  Blocks 

It  might  be  pointed  out  that  though  as  indicated  these  designs  were  intro- 
duced to  evade  the  difficulty  of  restricted  block  size,  they  can  be  adapted  for 
other  purposes.  For  example,  if  we  had  a  two  factor  experiment,  each  factor 
at  7  levels,  complete  execution  would  require  72  =  49  units.  We  could,  however, 
obtain  satisfactory  estimates  of  the  two  factors  by  use  of  the  design  discussed, 
which  involves  only  21  runs,  and  which  gives  the  variance  of  the  difference  between 
two  corrected  means  as  6ak2/7.  The  complete  experiment  would  have  given  it 
as  2ak2/7,  so  our  error  in  the  incomplete  case  is  larger,  but  may  still  be  small 
enough  for  our  purposes. 

The  alternative  classical  solution  of  this  problem  would  be  to  carry  out  the 
treatments  Tx  to  T7  with  B  constant  at  Ba  and  Ba  to  Bg  with  T  constant  at  Tt 
which  would  require  13  units  or  runs.  In  the  absence  of  replication  the  experi- 
ment is  most  unsatisfactory,  for  there  is  no  estimate  of  error  and  no  means  of 
testing  for  significance.  If  we  replicate  all  treatments,  making  a  total  of  26  runs, 
we  then  get  the  comparison  of  our  means  with  a  variance  ak2.  Comparing  with 
the  balanced  incomplete  block  arrangement,  we  have  done  24%  more  work  and 
got  an  error  variance  14%  larger. 

(e)  Youden  Squares 

As  stated  in  Chapter  I,  Section  (g),  in  those  cases  where  the  number  of  blocks 
equals  the  number  of  treatments  it  is  possible  to  arrange  the  order  of  treatments 
in  each  block  so  that  each  row  contains  all  treatments  once  and  once  only. 

Table  1.7  gave  the  design  for  t  =  b  =  5,  r  =  k  =  4,  using  the  symbols 
denned  at  the  beginning  of  Chapter  XIII  (b).  Thus  the  numbers  of  treatments 
(t)  and  of  blocks  (b)  equalled  5  and  the  numbers  of  units  per  block  (k)  and  of 
replications  of  each  treatment  (t)  equalled  4. 

The  other  randomised  blocks  of  Table  13.3  can  be  rearranged  as  in  Table  13.4 
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Table  13.4 

t  =  b  =  4,  k  =  r  =  3  t  =  b  =  7,  k  =  t  =  4 

ABCD  ABCDEFG 

BCDA  BCEFGAD 

CDAB  GFDACBE 

D  G  B  C  A  E  F 

Other  designs  are  in  Table  13.5 

Table  13.5 
t  =  b  =  7,  k  =  r  =  3         t  =  b  =  6,  k  =  r  =  5 
ABCDEFG  ABCDEF 

BFDACGE  BCDEFA 

CDGEFAB  CDEFAB 

D  E  F  A  B  C 
E  F  A  B  C  D 

t=b=  11,  k  =  r  =  5 

ABCDEFGHI  J  K 
BCFKI  GEAJ  DH 
CGDI  KJ  HFABE 
DJ  KGFEI  BCHA 
EKHABDCGFI  J 
Youden  (3)  gives  several  useful  solutions  (Table  13.6). 

Table  13.6 
t  =  b  =  13,  k  =  r  =  4 
ABCDEFGHI     J     KLM 
CDEFGHI     J     KLMAB 
DEFGHI     J     KLMABC 
HI     J     KLMABCDEFG 

t  =  b  =  2i,  k=r— 5 
ABCDEFGHI  J  KLMNOPQRSTU 
DEFGHI  J  KLMNOPQRSTU  ABC 
EFGHI  J  KLMNOPQRSTU  A  BCD 
J  KLMNOPQRSTU  ABCDEFGHI 
LMNOPQRSTUABC  DEFGHI  J  K 
(3)  W.  J.  Youden.  Contributions  from  Boyce  Thompson  Institute,  Vol.  9,  page  41,  1937. 
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The  computation  of  the  analysis  of  variance  for  a  Youden  Square  proceeds 
on  exactly  the  same  lines  as  an  ordinary  balanced  incomplete  block  experiment. 
The  Sums  of  Squares  and  Degrees  of  Freedom  for  the  Total,  Blocks,  and  Treat- 
ments are  calculated  as  in  Section  XIII  (b)  and  the  only  difference  is  that  one 
obtains  a  sum  of  squares  and  degrees  of  freedom  for  rows  before  finding  by  differ- 
ence the  sum  of  squares  and  degrees  of  freedom  for  the  Residual  Error. 

The  degrees  of  freedom  for  rows  is  obviously  one  less  than  the  number  of 
rows,  or  (k — 1)  =  (r — 1).  The  sum  of  squares  is  found  by  summing  the 
squares  of  the  row  totals,  dividing  this  total  by  the  number  of  individuals  in  a  row 
(=  t  =  b),  and  subtracting  from  this  result  the  usual  correcting  factor,  the  grand 
total  squared  divided  by  the  grand  number  of  individuals. 

CHAPTER  XIV 

CONFOUNDING  :    THE  PROBLEM  OF  RESTRICTED  BLOCK  SIZE 
IN  FACTORIAL  EXPERIMENTS  C1) 

(a)  The  Algebraic  Expressions  for  Factors 

In  Chapter  I  we  mentioned  that  for  factorial  experiments  of  the  type  2° 
(n  factors  all  at  2  levels)  the  number  of  runs  required,  being  2n,  rapidly  increased 
and  for  4,  5  and  6  factors  is  16,  32  and  64  respectively.  These  numbers  may  be 
too  large  to  get  into  a  single  block,  and  the  problem  then  arises  of  how  to  split 
the  experiment  up  into  smaller  units. 

Let  us  introduce  an  appropriate  symbolism.  Let  the  small  letters  p,  q,  r, 
etc.,  stand  for  the  condition  that  the  factor  referred  to  is  at  its  upper  level.  For 
example  rs  would  mean  that  p  and  q  are  at  their  lower  levels  and  r  and  s  at  their 
upper  levels.  Let  the  capital  letter  P  denote  the  difference  between  the  sum  of 
all  the  upper  levels  of  P  and  the  sum  of  all  the  lower  levels  of  P,  and  similarly 
for  the  other  factors.     Then  for  a  three  factor  experiment, 

P  =  pqr  +  pq  +  pr  +  p  —  qr  —  q  —  r  —  1 
where   1    implies  all  the  factors   at  their  lower  levels.     Algebraically,   this  is 
equivalent  to 

P  =  (p-l)(q+l)(r+l) 
and  we  can  show  that 

Q  =  (p+l)(q-l)(r+l), 

R  =  (p+l)(q+l)(r-l). 
The  first  order  interaction  PQ  is  given  by  the  difference  between  P  at  the 
lower  level  of  Q,  namely, 

pr  +  p  —  r  —  1 

(*)  For  discussions  of  confounding  see  R.  A.  Fisher  :  "The  Design  of  Experiments," 
Chapter  VI  (Oliver  and  Boyd),  and  F.  Yates  :  "The  Design  and  Analysis  of  Factorial 
Experiments"  (Imperial  Bureau  of  Soil  Science). 

Much  mathematical  ingenuity  has  been  devoted  to  the  conception  of  confounding. 
The  important  papers  include  : 

K.  R.  Nair  (1938).  On  a  method  of  getting  confounded  arrangements  in  the 
general  symmetrical  type  of  experiment.     Sankhya,  Vol.  4,  121-138. 

K.  R.  Nair  (1940).  Balanced  confounded  arrangements  for  the  5n  type  of  experi- 
ment.    Shankhya,  Vol.  5,  57-70. 

R.  C.  Bose  and  K.  Kishen  (1940).  On  the  problem  of  confounding  in  the  general 
symmetrical  factorial  design.     Sankhya,  Vol.  5,  21-36. 

R.  A.  Fisher  (1942).  The  theory  of  confounding  in  factorial  experiments  in 
relation  to  the  theory  of  groups.     Annals  of  Eugenics,  Vol.  11,  341-353. 

R.  A.  Fisher  (1945).  A  system  of  confounding  for  factors  with  more  than  two 
alternatives,  giving  completely  orthogonal  cubes  and  higher  powers.  Annals  of 
Eugenics,  Vol.  12,  283-290. 

A  summary  of  the  methods  of  forming  the  layouts  is  given  by  D.  J.  Finney  (1947). 
The  construction  of  confounding  arrangements.     Journal  of  Experimental  Agriculture, 
Vol.  15,  107-112. 
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and  P  at  the  upper  level  of  Q,  namely, 

pqr  +  pq  —  qr  —  q. 
This  gives 

PQ  =  pqr  +  pq  — -  qr  —  q  —  pr  —  p  +  r  +  1 
and  algebraically  this  is  equivalent  to 

PQ  =  (p-l)(q-l)(r+l).  , 

It  will  be  noted  that  we  get  exactly  the  same  result  if  we  regard  PQ  as  the 
difference  between  Q  at  the  lower  level  of  P,  namely, 

qr  +  q  —  r  —  1 
and  Q  at  the  upper  level  of  P,  namely, 

pqr  +  pq  —  pr  —  p  : 
for  this  gives 

PQ  =  pqr  +  pq  —  qr  —  q  —  pr  —  p  +  r  +  1 
=  (p  —  1)  (q  —  1)  (r  +  1). 
Similarly 

QR  =  (p+l)(q-l)(r-l) 

RP=(p-l)(q+l)(r-l). 

The  second  order  interaction  PQR  is  rather  more  complicated  :  we  can  regard 
it  as  the  difference  between  PQ  at  the  lower  level  of  R  and  PQ  at  the  upper  level 
of  R. 


The  former  is 

(PQ)Ri  =  pq- 
and  the  latter  is 

(PQ)r2  =  pqr 
so  the  difference  is 

PQR  =  pqr  —  qr 
=  (p  —  l)(q 


l-P+1 
qr  —  pr  4-  r 


1 


pr-f-r  —  pq  +  q  +  p 
-l)(r-l). 
As  with  a  first  order  interaction,  so  with  a  second  order  interaction  there  are 
complementary  ways  of  looking  at  it ;  we  might  have  regarded  it  as  the  difference 
between  QR  at  Px  and  QR  at  P2,  or  as  the  difference  between  RP  at  Qj  and  RP 
at  Q2. 


(b)  Confounding  with  Three  Factors 

Now  consider  that  we  have  a  three  factor  experiment,  which  will  take  23  =  8 
runs,  and  we  need  to  put  this  experiment  not  into  one  block  of  8  runs  but  into 
2  blocks  of  4  runs,  the  block  size  admitting  only  up  to  4  runs.  If  we  consider 
that  we  are  mainly  interested  in  the  main  effects  and  the  first  order  interactions, 
and  are  willing  to  sacrifice  the  second  order  interaction  to  achieve  this  end,  we 
can  allocate  the  treatments  to  the  two  blocks  as  in  Table  14.1. 

Table  14.1 


Block  1 

Block  2 

pqr 

r 
P 

q 

1 

qr 

Pr 

pq 

The  second  order  interaction  PQR  is  now  lost  as  it  is  identical  with  the  possible 
block  differences,  for  the  effect 

pqr  4-  r  4-  p  4-  q  — -  1  —  qr  —  pr  —  pq 
is  both  PQR  and  the  difference  between  Block  1  and  Block  2.     This  confusing 
of  effects  with  block  differences  is  known  as  "confounding." 


143 


We  have  now  to  assure  ourselves  that  our  estimates  of  the  main  effects  and 
first  order  interactions  are  not  affected  by  possible  block  differences.     Suppose 
that  Block  2  has  a  bias  such  that  all  the  four  results  contained  in  it  come  out  d 
higher  than  they  would  have  done  if  they  had  been  in  Block  1. 
We  have 

P  -  Pqr  +  (pq  +  d)  +  (pr  +  d)  +  p  —  (qr  +  d)  —  q  —  r  —  (1  +  d) 
=  pqr  -f  pq  +  pr  -f  p  —  qr  —  q  —  r  —  1 

as  before,  the  biases  d  vanishing,  and  similarly  for  the  two  other  main  effects. 
Also 

PQ  =  pqr+(pq  +  d)  —  (qr  +  d)  -  q  —  (pr  +  d)  -  p  +  r  +  (1  +  d) 
=  pqr  +  pq  —  qr  —  p  —  pr  —  p  +  r  +  1 
as  before,  the  biases  d  again  vanishing,  and  similarly  for  the  other  two  first  order 
interactions. 

(c)  Confounding  with  Four  Factors 

With  four  factors,  the  total  number  of  runs  required  is  2*  =  16.     The  main 
effects  are  of  the  type 

P  =  (p  -  1)  (q  +  1)  (r  +  1)  (s  +  1), 
the  first  order  interactions  of  the  type 

PQ  =  (p_l)(q_l)(r+l)(s+l), 
the  second  order  interactions  of  the  type 

PQR=    (p  —  1)  (q  —  1)  (r  _  1)  {s  +  1), 
and  the  third  order  interaction  is 

PQRS  =  (p-l)(q-l)(r-l)(s-l). 
If  we  expand  the  latter  it  becomes 

PQRS  =  pqrs  —  pqr  —  pqs  +  pq  —  prs  -\-  pr  -f-  ps  —  p  —  qrs 
+  qr  -f-  qs  —  q  +  rs  —  r  —  s  -f-  1. 
If  we  wish  to  confound  in  2  blocks  of  8,  and  select  PQRS  as  the  term  to  be 
confounded,  then  we  merely  need  to  place  all  those  terms  with  a  +  sign  in  one 
block  and  all  those  with  a  —  sign  in  the  other  block,  as  in  Table  14.2. 

Table  14.2 


Block  1 
Block  2 

pqrs  -f  pq  +  pr  +  ps  +  qr  -f-  qs  -f  rs  +  1 
pqr  +  pqs  +  prs  +  qrs  +  p  +  q  +  r  +  s 

To  confound  in  4  blocks  of  4,  we  need  to  select  3  terms  to  correspond  to  the 
3  degrees  of  freedom  that  we  confound  with  block  differences.  It  is  found  that 
while  we  can  select  any  two  arbitrarily,  the  third  is  then  uniquely  determined. 
The  third  is  determined  by  the  rule  that  we  multiply  the  first  two  together  by 
the  ordinary  laws  of  algebra,  but  wherever  a  squared  term  appears  we  place  it 
equal  to  unity.  Thus  if  the  two  selected  are  PQ  and  RS,  then  the  third  is  PQRS  : 
if  the  two  selected  are  PQR  and  QRS,  then  the  third  is  PS. 

A  little  experiment  will  show  that  with  4  factors  to  be  confounded  in  4  blocks 
of  4  it  is  impossible  to  avoid  including  one  first  order  interaction  amongst  the 
three  that  are  to  be  confounded. 

One  method  of  dividing  the  treatments  into  their  appropriate  blocks  is  to 
expand  two  of  the  selected  interactions  which  are  being  confounded,  as  in  Table  14.3 
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Table  14.3 


PQR 
QRS 

pqrs  —  pqs  +  pqr  —  pq  —  qrs  -f  qs  —  qr  +  q 
pqrs  —  pqs  —  pqr  +  pq  +  qrs  —  qs  —  qr  +  q 

PQR  cont 
QRS  cont 

—  prs  +  ps  —  pr  +  p-f-rs  —  s-j-r  —  1 

—  prs  +  ps-f-pr  —  p  —  rs  +  s-f-r  —  1 

Where  one  treatment  has  the  same  plus  sign  in  both  expansions,  we  place  it 

in  one  block.     Similarly ,  -f  — ,  and f-  form  the  other  blocks,  as  in 

Table  14.4. 

Table  14.4 


Block  1 

Block  2 

Block  3 

Block  4 

+  + 



+  — 

—  + 

pqrs 

pqs 

pqr 

pq 

q 

qr 

qs 

qrs 

ps 

prs 

P 

Pr 

r 

1 

rs 

s 

There  is  an  alternative  method  of  writing  down  the  treatment  which  in  the 
more  complicated  cases  is  less  troublesome.  We  consider  the  three  interactions 
which  we  have  selected  for  confounding,  here 

PQR,  QRS,  PS. 
We  place  in  the  first  block  those  treatments  which  have  an  even  number  of  letters 
in  common  with  the  letters  of  these  three  terms.  The  treatment  (1)  is  obviously 
satisfactory,  for  it  has  none  in  common  (and  zero  is  an  even  number).  The 
treatment  q  r  is  also  satisfactory,  for  it  has  2  letters  in  common  with  both  PQR 
and  QRS  and  none  in  common  with  PS.  The  two  others  satisfying  the  condition 
are  prs  and  pqs.  This  has  given  us  one  block,  actually  equivalent  to  Block  2  in 
Table  14.4.  To  get  the  next  block,  we  select  any  treatment  which  has  not  yet 
occurred,  say  p,  and  multiply  each  of  the  terms  in  our  first  block  by  this,  using 
the  convention  as  before  that  e.g.  p2  =  1.     Thus  we  get 

pqs  X  p  =  p2  qs  =  qs 

qr     x  p  =  qpr 

prs   X  p  =  p2  rs  =  rs 

1  X  p  =  p 
Accordingly,  (p,  rs,  qpr  and  qs)  form  our  second  block,  which  actually  is  Block  3 
of  Table  14.4.  The  next  block  is  formed  by  selecting  a  treatment  which  has  not 
yet  occurred,  say  q,  and  multiplying  the  term  of  the  first  block  in  turn.  This  will 
give  us  Block  1  of  Table  14.4.  Selecting  any  treatment  not  yet  mentioned,  say  s, 
and  carrying  out  the  same  operation  given  as  Block  4. 


(d)  Confounding  with  Five  Factors 

With  five  factors  we  require  a  total  of  26  ="  32  runs.  Accordingly  there  are 
a  number  of  possibilities  according  to  the  size  of  block  we  have  available. 

For  splitting  into  2  blocks  of  16  we  obviously  choose  to  confound  the  highest 
order  interaction,  PQRST.  To  allocate  the  treatments  into  the  two  blocks,  we 
can  expand  (p  —  1)  (q  —  1)  (r  —  1)  (s  —  1)  (t  —  1)  and  place  all  treatments  with 
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a  -f-  sign  in  one  block  and  all  those  with  a  —  sign  in  the  other  block.  Alter- 
natively, we  get  the  same  result  by  taking  for  one  block  all  those  treatments  with 
an  even  number  of  letters  in  common  with  PQRST  :  these  are  (1)  (no  letter  in 
common),  the  ten  treatments  of  the  type  pq,  qr,  pr,  ps,  etc.,  and  the  five  treatments 
of  the  type  pqrs,  qrst,  etc. 

For  splitting  into  4  blocks  of  8,  the  best  set  of  interactions  to  be  selected  for 
confounding  is  of  the  type 

PQR,  RST,  PQST 
which  avoids  losing  any  first  order  interaction.     For  allocation  into  the  four 
blocks,  we  can  expand  any  two,  say  PQR  and  RST,  and  distribute  the  treatments 

according  to  whether  their  signs  in  the  two  expansions  are  -| — f-,  -\ , h,  or 

,  as  in  the  previous  section.     Alternatively,  we  can  take  as  the  first  block 

those  treatments  with  an  even  number  of  letters  in  common  with  the  interactions 
selected  for  confounding.     These  are 

(1),  pq,  st,  pqst,  qrs,  qrt,  prs,  pit. 
The  subsequent  blocks  are  obtained  by  multiplying  each  of  these  treatments  by 
p  for  the  second  block,  s  for  the  third  block,  and  r  for  the  fourth  block. 

For  splitting  into  8  blocks  of  4,  the  best  we  can  do  is  to  lose  2  first  order 
interactions.     A  typical  set  is 

PQ,  RS,  PRT,  QRT,  PST,  QST,  PQRS 
One  method  of  building  up  the  treatment  combinations  for  the  eight  blocks  is  to 
expand  any  three  (the  third  not  being  the  product  of  the  first  two  :    e.g.  if  we 
select  PQ  and  RS  as  the  first  two,  then  the  third  cannot  be  PQRS)  and  then  the 
eight  blocks  are  made  up  according  to  whether  a  treatment  has  the  signs  in  the 

three  expansions  +  +  +,  +  H ,  + ,  H ■  +, h  +, 1 , , 

or 1-.     The  alternative  method  is,  however,  less  arduous.     Our  first  block 

would  be 

(1),  pqt,  rst,  pqrs. 
and  the  other  seven  are  obtained  by  multiplying  by  treatments  not  hitherto 
mentioned..    For  example,  to  get  our  next  block  we  can  multiply  by  p,  getting 

p,  qt,  prst,  qrs. 
For  the  third  block  we  can  multiply  by  q,  getting 

q,  pt,  qrst,  prs, 
and  so  on,  till  all  treatments  have  occurred. 

(e)  Confounding  with  Six  Factors 

With  six  factors  we  require  26  =  64  runs.  In  confounding  in  2  blocks  of  32 
we  obviously  choose  the  highest  order  interaction  PQRSTU  to  be  confounded. 
For  four  blocks  of  16  the  most  satisfactory  set  for  confounding  is 

PQRS,  RSTU,  PQTU, 
and  for  8  blocks  of  8 

PQR,  RST,  PSU,  QTU,  PQST,  QRSU,  PRTU. 
In  all  these  cases  we  avoid  losing  any  first  order  interaction.     To  get  the  64  runs 
into  16  blocks  of  4,  however,  losing  3  first  order  interactions  is  unavoidable.     A 
typical  set  of  15  interactions  for  confounding  is 

PQ,  RS,  PRT,  QRT,  PST,  QST,  PQRS,  TU,  PQTU,  RSTU, 

PRU,  QRU,  PSU,  QSU,  PQRSTU. 
For  allocating  the  treatments  to  their  appropriate  blocks  the  same  method 
as  hitherto  can  be  applied.     The  first  block  is 

(1)  pqrs,  rstu,  pqtu 
and  the  subsequent  blocks  can  be  obtained  by  multiplication. 

(f)  Computation  of  the  Results  of  a  Confounded  Experiment : 

An  Example 

In  a  confounded  experiment  the  computation  proceeds  exactly  as  though  the 
experiment  had  not  been  confounded,  with  the  sole  difference  that  we  do  not 

146 


compute  those  interactions  which  we  have  confounded.  Instead,  we  take  the 
block  totals,  square  them,  divide  by  the  number  of  treatments  occurring  in  each 
block,  and  subtract  the  usual  correcting  factor  (grand  total  squared  over  grand 
total  number  of  observations).  The  degrees  of  freedom  for  this  sum  of  squares 
is  of  course  one  less  than  the  number  of  blocks.  We  would  get  an  identical  result 
if  we  calculated  the  sum  of  squares  for  the  interactions  being  confounded,  and 
then  pooled  them. 

As  an  example,  let  us  consider  an  experiment  on  a  process  employing  a 
mixed  catalyst  made  up  of  three  components.  We  wish  to  vary  catalyst  com- 
ponent A  at  4  levels  and  components  C  and  D  each  at  2  levels,  and  we  measure 
the  yield  as  the  dependent  variable.  A  straightforward  factorial  experiment 
would  require  4x2x2=16  runs.  However,  it  was  necessary  to  confound 
the  experiment  in  blocks  of  4,  4  blocks  thus  being  required. 

The  first  problem  is  how  to  confound  with  a  factor  at  4  levels.  Yates  has 
given  a  method  by  which  it  can  be  regarded  as  two  factors  each  at  2  levels.  If 
we  regard  the  two  factors  as  A'  and  A",  with  suffices  1  and  2  to  indicate  their 
lower  and  upper  levels,  and  if  the  four  levels  of  the  four  level  factor  are  ax  to  a4) 
then  Table  14.5  gives  the  appropriate  allocation. 


Table  14.5 

A  x     A  2 

A"2 

a2    a3 
ax    a4 

With  this  allocation 

A'    =  a4  +  a3  —  a2  —  ax 
A''   =  a4  —  a3  —  a2  +  aj 
and  the  interaction  of  A'  and  A",  which  we  may  call  A'",  is 
A'"  =  a4  —  a3  +  a2  —  ax 

Accordingly,  the  linear  component  is  2A'  +  A'",  the  quadratic  component 
is  A",  and  the  cubic  component  is  2A'"  —  A'.  For  confounding  we  assume 
that  the  cubic  component  is  negligible  and  confound  A'",  this  leaving  us  estimates 
of  the  linear  and  quadratic  components. 

Reference  to  Section  (c)  will  show  that  with  four  factors  (all  at  two  levels) 
being  confounded  one  first  order  interaction  must  be  lost :  we  select  A'"  as  this 
(equivalent  to  A' A").  The  other  two  most  suitable  interactions  for  confounding 
are  clearly  A'CD  and  A"CD. 

The  allocation  of  treatments  to  the  four  blocks  proceeds  in  the  usual  manner, 
and  is  set  out  with  the  yields  obtained  in  Table  14.6. 


Table  14.6 


Block  1 

Block  2 

Block  3 

Block  4 

(1)     24 
cd     31 
a'a"d    6 
a'a"c    14 

a'      25 
a'cd    23 
a"d    24 
a"c    36 

a"     56 
a"cd    41 
a'd     27 
a'c     40 

c      48 
d      29 
a'a"cd   23 
a'a"    25 

75 

108 

164 

125 
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The  symbolism  for  treatments  is  such  that,  for  example,  a'a"  implies  both 
A'  and  A"  at  their  upper  levels  ;  reference  to  Table  14.5  shows  that  A'2A"2  corre- 
sponds to  a4,  i.e.  the  highest  level  of  the  four-level  factor.  Factors  C  and  D  are 
of  course  at  their  lower  levels.  The  treatment  a"cd  has  C  and  D  at  their  upper 
levels  :  a"  implies  A'iA"j  (i.e.  A'  at  its  lower  level  and  A"  at  its  upper  level),  and 
Table  14.5  gives  this  as  a2. 

The  last  line  in  Table  14.6  gives  the  block  totals,  and  the  sum  of  squares 
for  Blocks  is  of  course 

-i  (752  -h  1082-j-  1642+  1252) 

and  this  sum  of  squares  has  3  degrees  of  freedom. 

The  remainder  of  the  analysis  proceeds  in  the  usual  fashion.  It  is  interesting 
to  note  that  if  we  calculate  the  three  interactions  confounded,  we  get  for  their 
sums  of  squares — 

A'A"  324.000 

A'CD  2.250 

A"CD  702.250 

and  this  total  agrees  with  that  obtained  above. 

If  we  wish  to  obtain  a  check  on  the  correctness  of  the  computation,  we  can 
calculate  the  highest  order  interaction,  A'A"C  D,  directly  rather  than  the  usual 
manner  of  obtaining  it  by  difference.  We  expand  (a'  —  1)  (a"  —  1)  (c — 1) 
(d  —  1)  as  a'a"c  d  +  a'c  +  a'd  +  a"c  +  a"d  +  a'a"  +  cd  +  1  —  a'cd  —  a"cd 
—  a'a"c  —  a'a"d  —  a'  —  a"—  c  —  d. 

Going  through  the  table  of  results  (Table  14.6)  we  obtain  as  this  total  ( — 12). 
The  sums  of  squares  for  A'A"CD  is  the  square  of  this  result  divided  by  the  number 
of  observations  concerned,  or  ( — 12)2/16  =  9.000.  This  checks  with  the  figure 
found  by  difference  from  the  total  sum  of  squares.  In  any  factorial  experiment 
with  factors  at  2  levels  it  is  open  to  us,  if  we  prefer,  to  calculate  all  the  sums  of 
squares  in  this  manner.  (2) 

The  table  of  analysis  of  variance  is  in  Table  14.7. 


Table  14.7 


Degree  of 

Sums  of 

Source  of  Variance 

Freedom 

Squares 

Blocks 

3 

1028.500 

.  f  Linear  Component  A' 
L  Quadratic  Component  A" 

702.250 

30.250 

C 

100.00 

D 

256.000 

A'C 

2.250 

A'D 

12.250 

A"C 

72.250 

A"D 

6.250 

CD 

36.000 

A'A"C 

36.000 

A'A"D 

25.000 

Residual 

9.000 

Total 

15 

2316.000 

(2)  A  systematic  method  of  making  these  calculations  is  given  by  F.  Yates  "Design  and 
Analysis  oi  Factorial  Experiments"  (Imperial  Bureau  of  Soil  Science),  pages  15  and  28. 
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The  interpretation  of  this  table  is  quite  straightforward.  The  Residual, 
which  of  course  is  A'A"C  D,  can  be  pooled  with  the  two  second  order  interactions 
A'A"C  and  A'A"D  to  give  a  mean  square  of  23.333  with  3  degrees  of  freedom 
which  we  use  for  error.  We  then  find  all  the  first  order  interactions  to  be  non- 
significant. Of  the  main  effects,  D  is  significant  at  the  1%  level,  C  non-significant, 
the  linear  component  of  A  is  significant  at  the  0.1%  level  and  the  quadratic  com- 
ponent of  A,  measuring  the  possible  curvature,  is  non-significant.  We  can  then 
proceed  to  calculate  the  relevant  averages  and  attach  the  appropriate  standard 
errors  to  them. 

It  is  noteworthy  in  Table  14.7  that  the  Mean  Square  for  Blocks,  1028.000/3 
=  342.667,  is  highly  significant  (0.1%).  With  confounding  the  estimate  of  error 
obtained  by  pooling  all  the  non-significant  first  order  interactions  is  18.178  (with 
7  degrees  of  freedom).  If  confounding  had  not  been  used,  but  instead  the  16 
treatments  allocated  at  random,  then  the  sum  of  squares  due  to  difference  between 
blocks  would  have  appeared  in  this  pooled  residual  error,  which  would  then  have 
been  115.575  with  10  degrees  of  freedom.  The  gain  in  accuracy  through  the  use 
of  confounding,  which  reduced  the  error  mean  square  to  18.178,  is  thus  very 
great,  and  without  it  the  experiment  would  have  been  almost  valueless. 

Finally,  it  is  of  interest  to  note  how  such  an  experiment  as  this  could  have 
been  tackled  traditionally.  The  best  that  could  have  been  done  would  be  to  put 
the  four  level  factor  A  into  two  blocks,  C  and  D  being  constant.  The  third  block 
could  have  contained  C  replicated  twice  (two  observations  at  each  of  the  two 
levels),  A  and  D  being  constant,  and  the  fourth  block  could  have  been  used  similarly 
for  D.  Such  an  experiment  would  only  give  half  the  accuracy  for  estimating  A 
and  one  quarter  the  accuracy  for  estimating  C  and  D  as  the  factorial  confounded 
experiment.  Further,  it  would  have  given  no  estimates  at  all  of  the  possible 
existence  of  interactions.  The  gain  through  the  use  of  the  factorial  .confounded 
design  is  thus  very  considerable. 

(g)  Confounding  with  Factors  at  Three  Levels 

A  particularly  useful  experiment  in  the  3  X  3  X  3  or  3 8,  three  factors  each 
at  three  levels,  as  with  the  use  of  three  levels  any  departure  from  linearity  may  be 
observed  and  with  the  method  of  partitioning  the  various  sums  of  squares  discussed 
in  Chapter  XII  (c)  the  significance  of  such  effects  can  be  readily  tested. 

The  utility  of  such  an  experiment  is  increased  by  the  possibility,  shown  by 
Yates(8)  of  confounding  it  in  three  blocks  of  nine  units.  R.  A.  Fisher(4)  has 
shown  the  relationship  of  this  arrangement  to  the  3x3  Latin  Square. 

In  order  to  get,  say,  the  interaction  between  A  and  B  free  of  block  differences 
we  need  the  nine  combinations  of  A  and  B  in  each  block.  This  can  be  easily 
arranged.  If  we  further  arrange  that  the  level  of  the  third  factor  C  is  determined 
by  a  3  X  3  Latin  Square,  and  subsequent  blocks  are  generated  by  cyclically 
permuting  the  level  of  C,  we  will  after  two  such  permutations  have  a  total  of 
three  blocks  between  them  containing  all  twenty-seven  combinations  of  the  three 
factors.     Thus  suppose  the  first  block  is  given  by  Table  14.8. 


Table  14.8 

A0        Ax 

A2 

Bo 
Bj 
B2 

Co      c, 

C2        Co 

C, 
Co 
Cx 

(3)  Yates,  ibid,  Section  10. 

(4)  Fisher,  ibid,  Section  46. 
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Dispensing  with  the  letters  and  considering  only  the  suffices,  the  treatments  are 
000,  101,  202,  Oil,  112,  etc.  Then  if  for  the  second  block  we  substitute  Cx  for 
C0,  C2  for  Cj  and  C0  for  C2  we  get  Table  14.9  and  a  further  substitution  gives 
Table  14.10. 


Table  14.9 

Table  14.10 

Ao    Ax 

A2 

A0   Ax 

A2 

Bo 
Bx 

B2 

Ci    C2 
C2    Co 

C0    Cx 

Co 
Cx 

c2 

Bo 
Bx 
B, 

C2    Co 

Co    Cx 
Cx    C2 

Cx 

c2 

Co 

Each  of  these  blocks  can  be  re-written  with  B  and  C  as  the  axes  and  A  as 
the  treatment  in  the  body  of  the  table,  or  with  A  and  C  as  the  axes  and  B  as  the 
treatment  in  the  body  of  the  table.  Thus  all  three  first  order  interactions  are 
conserved  free  of  block  differences,  as  of  course  are  also  the  three  main  effects. 

It  will  be  found  that  there  are  four  possible  such  solutions.  The  one  above 
in  what  Yates  calls  his  Y  set  and  can  be  derived  from  Fisher's  type  I  Latin  Square. 

The  other  three  are  obtainable  by  permuting  C  as  above  in  Table  14.11. 


Z  = 

II 

W  =  III 

X  =  IV 

Ao 

Ax 

A2 

Ao  Ax 

A2 

A©  A2 

A2 

Bo 

C0 

c2 

Cx 

Bo 

Co  Cx 

ca 

Bo 

Co  C2 

Cx 

Bx 

c, 

c, 

C„ 

Bx 

C2  Cq 

Cx 

B, 

Cx  Co 

ca 

B2 

Cx 

Co 

C2 

Ba 

Cx  C2 

Co 

B2 

C2  Cx 

Co 

In  any  single  3 3  experiment  we  can  choose  any  solution,  and  the  two  degrees 
of  freedom  confounded  between  the  three  blocks  come  from  the  second  order 
interaction  which  we  are  using  as  error.  If  the  experiment  is  being  replicated 
more  than  once,  it  would  be  preferable  to  use  a  different  solution  each  time  as 
this  will  allow  a  detailed  analysis  of  the  second  order  interaction.  In  particular, 
if  we  wanted  to  test  the  existence  of  Al  Bl  Cl  we  could  do  so.  The  partial 
confounding  is  tricky  to  compute  ;  the  reader  who  requires  it  will  find  the  method 
in  Yates,  Section  10  (b). 

In  parenthesis  it  may  be  remarked  that  partial  confounding  is  applicable  to 
any  experiment.  For  example,  in  a  23  experiment  with  four  replications,  con- 
founded in  blocks  of  four,  the  second  order  interaction  and  the  three  first  order 
interactions  can  each  be  confounded  once(6). 

In  the  case  of  a  single  replication  the  calculation  of  the  analysis  of  variance 
is  quite  straightforward.  Table  14.12  shows  the  results  of  an  experiment  on  a 
particular  physical  property  of  cordite  manufactured  under  various  conditions. 
Factors  A  and  B  were  the  percentages  of  two  ingredients  which  both  were  arranged 


(5)  Yates,  ibid,  Sections  4  and  5 

Goulden,  "Methods  of  Statistical  Analysis' 


(John  Wiley,  1939),  p.  162. 
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to  increase  regularly  on  a  linear  scale  so  as  to  allow  a  ready  testing  of  possible 
curvatures  in  the  response  by  the  method  of  partitioning  the  sums  of  squares 
on  the  lines  discussed  in  Chapter  XII  (i).  C  was  the  temperature  at  a  particular 
part  of  the  process,  also  on  a  scale  with  regular  increments.  The  experiment 
was  confounded  in  three  blocks  of  nine  using  tne  W  solution  given  above  in  full, 
the  blocks  corresponding  to  batches  of  raw  material. 


Table  14.12 


Block  1 

Block  2 

Block  3 

000     131 

001     125 

002     147 

101      98 

102     109 

100      97 

202      54 

200      39 

201      35 

011     110 

012     130 

010     122 

112      91 

110      83 

111      72 

210      26 

211      20 

212      36 

022     130 

020     114 

021     106 

120      74 

121      68 

122      84 

221      17 

222      32 

220      20 

731 

720 

719 

The  analysis  of  variance  proceeds  in  the  usual  manner.  The  three  two-way 
tables  are  formed  for  A  X  B,  A  X  C,  and  B  X  C.  Using  the  methods  of  Chapter 
XII  (c)  on  the  marginal  totals  the  three  main  effects  can  be  partitioned  into  their 
linear  and  quadratic  components,  and  operating  with  the  coefficients  of  Table  12.6 
on  the  figures  in  these  two-way  tables  the  three  interactions  can  be  similarly 
decomposed.  The  main  effects  each  have  two  degrees  of  freedom  and  the  inter- 
action each  four  degrees  of  freedom.  The  blocks  have  two  degrees  of  freedom, 
so  the  Residual  has  27  —  1  —  3  X  2  —  3  X  4  —  2  =  6  degrees  of  freedom  and 
its  sum  of  squares  can  be  obtained  by  difference.     The  results  are  in  Table  14.13. 

The  analysis  is  most  conveniently  checked  by  computing  the  sums  of  squares 
for  the  unpartitioned  main  effects  and  interactions.  For  each  term  the  sum  of 
squares  should  equal  the  sum  of  squares  of  the  components.  These  checks  are 
good  as  they  are  arrived  at  by  an  independent  route.  The  only  remaining  check 
required  is  the  total  sum  of  squares  and  the  derivation  of  the  residual  by  difference 
and  these  need  doing  carefully. 

To  test  the  significance  of  any  item  in  Table  14.13  we  have  the  residual 
variance  equal  to  3.260.  For  any  mean  square  to  give  a  variance  ratio  greater 
than  6.0,  the  5%  point  for  degrees  of  freedom  1  and  6,  it  must  exceed  6.0  X  3.260 
=  19.56,  while  for  the  1%  significance  level  mean  squares  must  exceed  13.7  X  3.26 
=  44.66. 

Inspection  of  Table  14.13  shows  that  two  interaction  terms,  Bq  Cl  and 
Al  Cq,  are  significant  at  the  5%  level  and  one,  Aq  Bl,  is  significant  at  the  1% 
level.  The  main  effects  are  all  enormously  significant :  in  particular  the  quadratic 
components  measuring  the  curvature  are  all  significant. 
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Table  14.13 


Degrees  of 

Sums  of 

Mean 

Source  of  Variance 

Freedom 

Squares 

Squares 

AL 

38827.555 

Aq 

462.296 

Bl 

2005.555 

Bq 

185,185 

Cl 

636.055 

Cq 

872.018 

AlBl 

3.000 

AlBq 

0.444 

AqBl 

53.778 

** 

Aq  Bq 

1.815 

AlCl 

0.750 

AlCq 

23.361 

* 

AqCl 

8.028 

AqCq 

8.898 

BlCl 

2.083 

BlCq 

3.361 

BqCl 

23.361 

* 

BqCq 

11.346 

Blocks 

2 

9.851 

4.925 

Residual 

6 

19.558 

3.260 

Total 

26 

43158.296 

The  interaction  most  worth  while  inspecting  is  the  AqBl  term.  Table  14.14 
is  formed  by  averaging  over  C.  If  AqBl  is  significant,  we  expect  to  find  the 
curvature  of  A  changing  linearly  with  B.  For  measuring  the  curvature  it  is  con- 
venient to  use  the  same  expression  as  is  used  for  partitioning  the  sums  of  squares, 
(2  Ax  —  Aq  —  A2),  and  this  is  given  for  each  level  of  B  in  the  last  line  of  the  table. 

Table  14.14 


B0                  Bx                  B2 

A0 
Aj 

A2 

134.32         120.65         116.65 

101.32           81.99           75.33 

42.66           27.33           23.00 

ZA1        Aq  —  A2 

25.66           16.00           11.01 

It  is  apparent  that  the  curvature  of  A  does  change  approximately  linearly 
with  B.  That  the  change  does  not  differ  significantly  from  a  linear  one  is  shown 
by  the  non-significance  of  AqBq  in  Table  14.13. 

However,  from  the  immediate  practical  viewpoint  of  finding  the  condition 
to  give  a  minimum  value  for  our  dependent  variable,  this  interaction  does  not 
affect  the  picture,  and  a  study  of  the  main  effects  indicates  the  choice  of  AgBaCx. 

It  will  be  noted,  incidentally,  from  Table  14.13  that  the  Block  Mean  Square 
is  not  significant  and  is  of  almost  the  same  magnitude  as  the  Residual.     We  have 
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therefore  gained  little  or  nothing  from  the  use  of  the  confounding.  This,  however, 
is  an  accidental  result,  as  it  was  known  that  in  general  the  batches  of  raw  materials 
which  made  up  the  blocks  do  tend  to  differ  appreciably. 

Yates (3)  has  shown  that  the  34  experiment  may  also  be  confounded  in  blocks 
of  9  confounding  only  second  order  interactions.  He  also  discusses  experiments 
with  mixed  numbers  of  levels  of  the  type  3x2x2,  etc. 

(h)  Confounding  with  Factors  at  Four  Levels. 

The  example  in  Section  (f)  has  discussed  the  confounding  of  the  4  x  2  x  2 
experiment  in  four  blocks  of  four.  The  four  level  factor  being  made  up  of  A' 
and  A",  with  their  interaction  A'A"  corresponding  to  the  main  in0redient  of  the 
cubic  component  of  curvature,  and  the  two  level  factors  being  C  and  D,  the  best 
set  of  interactions  for  confounding  in  4  blocks  of  4  were 
A'A",  A'  C  D,  A"  CD 

In  the  less  stringent  case  of  confounding  in  two  blocks  of  8,  only  one  inter- 
action need  be  confounded,  so  we  would  obviously  choose  A'A"CD.  The  division 
into  the  two  blocks  follows  from  the  expansion  of  (a'  —  1)  (a"  —  1)  (c  —  1)  (d  —  1) 
and  the  placing  of  all  treatments  with  a  +  sign  in  one  block  and  all  treatments 
with  a  —  sign  in  the  other  block.  Using  Table  14.5  to  convert  treatments  from 
the  a'a"  form  to  the  a4  form,  we  obtain  Table  14.15. 

Table  14.15 


Block  1 
Block  2 


a4cd  +  a4  +  a3c  +  a3d  +  atc  +  axd  +  a2cd  +  a 
a3  +  ax  +  a2c  +  a2d  +  a4c  +  a4d  +  a3cd  +  axcd 


The  4x4  experiment  cannot  be  so  easily  confounded  as  the  4x2x2.  It 
goes  into  two  blocks  of  8  satisfactorily  by  confounding  A'A"  B'B".  The  expan- 
sion of  (a'  —  1)  (a"  —  1)  (b'  —  1)  (b"  —  1)  and  conversion  into  terms  of  alt  etc., 
with  Table  14.5  gives  us  Table  14.16. 


Table  14.16 

Block  1 
Block  2 

*i  bx  -f-  ax  b3  -f  a3  bx  -j-  a3  b3  +  a2  b2  +  a2  b4  -f  a4  b2  +  a4  b4 
ax  b4  +  a4  bx  +  ax  b2  +  a2  bx  +  a2  b3  -f-  a3  b2  +  a3  b4  4-  a4  b3 

To  confound  the  4x4  experiment  in  4  blocks  of  4  is  not  easy,  as  might  be 
expected.  Yates(6)  suggests  the  use  of  partial  confounding,  the  three  sets  con- 
founding the  interactions 

(a)  A'B',  A"B",  A'"B'" 

(b)  A'B",  A"B'",  A"'B' 

(c)  A'B'",  A"B',  A"'B" 

where  A'"  =  A'A"  and  B'"  =  B'B".  These  confound  once  each  of  the  nine 
degrees  of  freedom  for  the  interaction.  The  use  of  three  replications  may  be 
more  than  is  considered  desirable,  but  the  use  of  less  will  in  each  case  involve 
losing  some  terms  which  might  be  of  interest.  If  only  one  replication  is  being 
made,  then  either  of  the  last  two  sets  would  be  the  best  to  choose.  The  arrange- 
ment of  the  treatment  combinations  follows  from  the  expansion  of  (a'  +  1) 
(a"  —  1)  (b'  —  1)  (b"  +  1)  and  (a'  —  1)  (a"  —  1)  (b'  +  1)  (b"  —  1)  and  grouping 
together  the  four  terms  with  plus  signs  in  both  the  expansions,  those  with  both 
negative,  and  so  on. 

(6)  Yates,  ibid,  Section  14  b. 
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If  a  4  X  4  x  4  experiment  has  to  be  confounded  in  four  blocks  of  16,  the 
simplest  procedure  is  to  use  the  same  technique  discussed  in  Section  (g)  for  con- 
founding the  3s  experiment.  In  this  case  we  can  write  down  a  4  X  4  Latin 
Square,  where  the  rows  and  columns  represent  the  levels  of  two  of  the  factors 
and  the  treatments  in  the  body  of  the  square  represent  the  level  of  the  third  factor. 
Such  a  Latin  Square  will  give  the  combination  for  one  of  the  blocks,  and  cyclically 
permuting  the  treatments  will  give  the  other  three  blocks.  Such  a  layout  will 
leave  the  main  effects  and  first  order  interactions  completely  unconfounded,  and 
the  three  degrees  of  freedom  taken  for  blocks  will  come  from  the  27  of  the  second 
order  interaction.  For  the  residual,  therefore,  we  have  27  —  3  =  24  degrees  of 
freedom,  and  its  sum  of  squares  can  be  obtained  by  difference. 

(i)  Double  Confounding (7) 

Sometimes  in  a  factorial  experiment  we  have  a  double  restriction  imposed 
on  the  system.  For  example,  in  a  26  experiment  (5  factors  all  at  2  levels,  requiring 
a  total  of  32  runs)  we  may  have  4  reactors  which  may  be  different.  Accordingly 
we  confound  in  4  blocks  of  8,  there  being  8  runs  on  each  of  the  4  reactors.  How- 
ever, our  batches  of  raw  material  may  not  be  large  enough  to  carry  out  the  whole 
experiment.  We  then  need  to  confound  the  experiment  in  an  additional  way. 
The  problem  is  soluble  if  we  make  it  the  inverse  of  the  first  type  of  confounding. 
Thus,  if  the  first  confounding  is  in  4  blocks  of  8  then  the  second  must  be  in  8  blocks 
of  4.  There  is  the  further  restriction  that  no  interaction  confounded  in  the 
second  set  must  occur  in  the  first  set,  and  vice  versa. 

For  the  confounding  in  8  blocks  of  4  let  us  select  for  confounding 
PQ,  RS,  PRT,  QST,  QRT,  PST,  PQRS, 
and  for  the  confounding  in  4  blocks  of  8  we  can  choose 

PQR,  RST,  PQST, 
none  of  the  second  set  occurring  in  the  first. 

To  allocate  the  treatments,  we  can  first  set  out  them  in  the  usual  manner  for 
confounding  in  8  blocks  of  4,  as  in  Table  14.17. 


Table  14.17 

Batch  1 

Batch  2 

Batch  3 

Batch  4 

Batch  5 

Batch  6 

Batch  7 

Batch  8 

(1) 

P 

q 

r 

s 

t 

pr 

qr 

pqrs 

qrs 

prs 

pqs 

pqr 

pqrst 

qs 

ps 

rst 

prst 

qrst 

st 

rt 

rs 

pst 

qst 

pqt 

qt 

Pt 

pqrt 

pqst 

pq 

rqt 

prt 

We  do  likewise  for  the  4  blocks  of  8,  as  in  Table  14.18. 

Table  14.18 


Reactor  1 

0) 

pq 

st 

pqst      qrt 

prt 

prs 

qrs 

Reactor  2 

P 

q 

pst 

qst        pqrt 

rt 

rs 

pqrs 

Reactor  3 

s 

pqs 

t 

pqt       qrst 

prst 

Pr 

qr 

Reactor  4 

r 

pqr 

rst 

pqrst    qt 

Pt 

ps 

qs 

(7)  c.f.  R.  A.  Fisher,  "The  Design  of  Experiments,"  3rd  edition,  p.  116. 
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We  now  superimpose  Tables  14.17  and  14.18  on  top  of  each  other.  Treatment 
(1)  belongs  to  Batch  1  and  Reactor  1,  and  is  entered  in  Table  14.19  accordingly. 
Treatment  pqrs  belongs  to  Batch  1  and  Reactor  2,  and  is  entered  thus.  Treatment 
rst  belongs  to  Batch  1  and  Reactor  4,  and  so  on. 


Table  14.19 

Batch  1 

Batch  2 

Batch  3 

Batch  4 

Batch  5 

Batch  6 

Batch  7 

Batch  8 

Reactor  i 

(1) 

qrs 

prs 

st 

pqst 

pq 

qrt 

pit 

Reactor  2 

pqrs 

P 

q 

pqrt 

rt 

rs 

pst 

qst 

Reactor  3 

pqt 

prst 

qrst 

pqs 

s 

t 

pr 

qr 

Reactor  4 

rst 

qt 

pt 

r 

pqr 

pqrst 

qs 

P» 

Table  14.19  will  be  found  to  satisfy  the  condition  of  both  Tables  14.17  and 
14.18,  and  accordingly  is  the  correct  allocation  of  treatments.  The  analysis  of 
variance  proceeds  normally,  all  the  interactions  being  confounded  not  being 
mentioned  specifically  in  the  table  of  analysis  of  variance  but  their  Sums  of  Squares 
and  Degrees  of  Freedom  being  lumped  together  under  the  description  "Blocks." 

It  will  be  noted  that,  if  we  were  confounding  in  8  blocks  of  4,  the  imposition 
of  the  second  restriction  leads  to  no  increase  in  the  number  of  first  order  interactions 
being  lost. 

Double  confounding  is  not  very  practical  in  the  24  experiment,  but  works 
well  in  the  26  as  described  above,  and  also  in  the  26,  where  there  are  two  possi- 
bilities, either  confounding  in  8  blocks  of  8  and  8  blocks  of  8  or  in  16  blocks  of  4 
and  4  blocks  of  16. 

For  the  former  case,  one  set  of  8  interactions  for  confounding  in 
PQRS,  RSTU,  PQTU,  PRT,  QST,  PSU  and  QRU 
and  another  non-overlapping  set  is 

PQRU,  PRST,  QSTU,  PQS,  RSU,  RQT  and  PTU. 

Thus  the  double  restriction  can  be  applied  without  losing  any  first  order 
interactions. 

For  double  confounding  in  16  Blocks  of  4  and  in  4  blocks  of  16,  one  set  of 
interactions  for  confounding  is 

PQ,  RU,  PRT,  QST,  QRT,  PST,  PQRS,  TU, 
PQTU,  RSTU,  PRU,  QSU,  QRU,  PSU,  PQRSTU, 
and  the  other  set,  non-overlapping,  is 
PQRU,  PRST,  QSTU. 

Accordingly,  if  we  are  confounding  in  16  Blocks  of  4,  we  lose  3  first  order 
interactions,  and  the  introduction  of  the  second  restriction  in  no  way  increases 
the  loss,  so  can  always  be  carried  out  when  convenient  or  necessary. 


CHAPTER  XV 


GENERAL  CONCLUSIONS 


(a)  Investigation  of  Multi-Variable  Processes 

The  reader  will  have  noticed  that  we  have  discussed  two  different  techniques 
for  the  investigation  of  multi-variable  systems,  namely  multiple  correlation  and 
the  factorial  experiment. 

Multiple  correlation  is  appropriate  where  we  have  a  considerable  mass  of 
data  on  the  system  under  investigation,  the  various  independent  variables  being 
at  random  and  at  all  levels.     Such  data  arises  from  the  normal  running  of  a  plant : 
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in  particular,  on  a  modern  plant  with  automatic  recorders  kept  in  proper  working 
condition  such  data  should  be  very  easily  collected. 

The  factorial  experiment  is  appropriate  where  we  can  make  a  deliberate 
experiment  upon  a  system,  controlling  the  independent  variables  at  the  levels 
laid  down  by  the  factorial  design.  The  execution  of  such  an  experiment  generally 
calls  for  careful  supervision  and  hence  tends  to  be  relatively  costly. 

The  multiple  correlation  approach  suffers  from  the  disadvantage  that  initially 
one  is  restricted  to  the  assumption  that  the  dependent  variable  is  a  linear  function 
of  the  independent  variables  which  are  also  assumed  to  be  acting  independently. 

It  is,  however,  possible  to  evade  some  of  these  difficulties  arising  from  non- 
linearity  by  use  of  the  various  forms  of  transformation  mentioned  in  Chapter 
IX  (b)  and  Chapter  X  (b).  It  will  be  noted  that  the  satisfactory  use  of  these 
devices  is  based  on  our  initially  selecting  the  appropriate  transformation,  either 
from  physico-chemical  knowledge  of  the  system,  or  from  an  intelligent  or  lucky 
inspection  of  the  data.  If  we  have  to  try  first  one  transformation,  then  another 
and  so  on  till  we  get  a  good  fit  the  amount  of  work  is  excessive. 

The  difficulty  with  regard  to  the  independence  of  the  independent  variables 
can  be  partially  circumvented  by  the  addition  of  product  terms  as  additional 
independent  variables,  e.g.  if  our  dependent  variable  is  y  and  we  have  independent 
variables  xx  and  x2  we  can  include  the  third  variable  (xx  x2),  i.e.  we  fit  an  equation 
of  the  form 

y  =  W  xt  +  b2  xa  +  bx  2(xx  x2) 
However,  if  there  are  three  independent  variables  and  we  wish  to  check  on  the 
possible  interactions  of  all  of  them,  the  equation  we  fit  is 

y  =  bx  xx  +  b2  x2  +  b3  x3  +  bx  2  (xx  x2)  +  b2  3  (x2  x3)  +  b3  j  (x3  xx) 
The  labour  of  calculating  all  the  relevant  sums  of  squares  and  sums  of  products 
and  then  solving  the  six  simultaneous  equations  would  be  very  heavy  and  only 
in  very  important  cases  could  it  be  justified. 

With  the  factorial  experiment,  our  initial  assumptions  are  very  general  and 
we  are  quite  prepared  to  see  any  of  the  first  order  interactions  significant,  and 
we  have  made  no  assumptions  as  to  linearity.  This  generality  means  that  a  factorial 
experiment  could  only  give  confusing  results  if  every  factor  was  interacting  strongly 
with  all  the  others,  a  very  rare  occurrence,  and  even  then  one  which  could  probably 
be  made  clearer  by  a  logarithmic  transformation.  Further,  if  some  of  our  factors 
are  at  more  than  two  levels  we  can  readily  check  on  the  possible  non-linearity  of 
the  effects  by  the  methods  of  Chapter  XII  (c).  The  total  computation  required 
is  very  much  less  than  with  multiple  correlation. 

The  factorial  experiment  has  the  further  advantage  that  it  can  generally  be 
increased  very  appreciably  in  accuracy  by  the  device  of  confounding. 

Taking  an  overall  view,  there  can  be  no  doubt  that  the  factorial  experiment 
is  to  be  greatly  preferred  to  an  examination  of  existing  data  by  mutliple  correlation. 
In  addition  to  all  the  advantages  set  out  above,  the  experiment  has  the  further 
advantage  that  it  is  not  liable  to  error  through  the  presence  of  unsuspected  corre- 
lated variables  :  this  is  the  distinction  between  experiment  and  observation 
discussed  in  Chapter  IX  (k). 

In  the  intermediate  case,  where  some  varfables  can  be  controlled  at  the  levels 
laid  down  by  a  factorial  design  but  where  some  of  them  are  more  difficult  to 
control,  the  most  appropriate  technique  would  be  covariance  :  there  are  no 
published  references  referring  to  industrial  problems  but  the  technique  is  described 
in  the  references  below.*1) 

t1)  For  a  description  of  the  method  of  covariance  see  R.  A.  Fisher  :  "Statistical  Methods 
for  Research  Workers,"  Section  49.1  (Oliver  and  Boyd).  K.  Mather  :  "Statistical  Analysis 
in  Biology,"  Section  34  (Methuen).  G.  W.  Snedecor  :  "Statistical  Methods,"  Chapter  12 
(Iowa  State  College  Press). 
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(b)  The  Advantages  of  Planning  Experiments 

It  will  have  been  apparent  that  none  of  these  statistical  designs  for  experi- 
ments can  be  applied  without  the  appropriate  forethought  and  planning.  They 
cannot  be  superimposed  on  a  hotch-potch  of  random  results. 

The  implication  of  this  are  rather  far-reaching.  For  example,  the  traditional 
style  of  experimenting  by  trying  first  one  thing,  then  if  that  does  not  work  then 
something  else,  and  so  on,  in  fact  the  policy  of  proceeding  by  small  steps,  is  quite 
unsuitable.  On  the  contrary,  we  require  to  sit  down  and  appraise  all  the  points 
that  appear  to  be  of  interest,  or  likely  to  be  of  interest  in  the  future,  and  then 
experiment  on  them  all  simultaneously.  We  thus  move  in  jumps  rather  than  in 
steps.  The  great  advantage  of  moving  in  jumps  is  that  the  information  given 
by  a  jump  adds  up  to  very  much  more  than  that  given  by  its  component  steps, 
though  requiring  no  more  effort. 

To  take  an  example  of  this,  consider  the  case  discussed  in  Chapter  XIII  (a) 
where  we  tested  7  treatments  in  blocks  of  3.  The  variance  of  the  comparison  of 
any  two  corrected  means  was  (6/7)  ok2,  =  0.857ark2,  where  ak2  was  the  error 
variance  of  a  single  observation  within  blocks. 

Now  suppose  that  instead  of  starting  off  the  experiment  with  the  full  number 
of  treatments  (7)  v/e  had  begun  with  only  4  and  had  afterwards  decided  to  test 
the  remaining  3.  Our  first  4  could  be  tested  with  the  design  given  in  Table  13.3, 
when  the  variance  of  the  comparison  of  any  two  of  these  means  would  be 

2  x  $E%  •*"  -  2  ><  3x(4p-l)  "*  '  °JW 

This  compares  favourably  with  the  figures  of  0.857ok2  for  the  whole  experiment. 
However,  when  we  come  to  the  3  additional  treatments,  it  is  impossible  to  execute 
them  in  such  a  manner  that  when  added  to  the  existing  data  we  get  the  same 
design  as  that  used  for  testing  7  treatments.  Accordingly  the  best  we  can  do  is 
to  select  one  treatment  as  a  "standard"  (say  treatment  T4)  to  occur  in  both  experi- 
ments, and  with  this  standard  and  the  remaining  3  treatments  we  carry  out  a 
further  balanced  incomplete  block  of  4  treatments  in  blocks  of  3,  requiring  3 
replications.  The  error  variance  of  the  comparison  of  any  two  of  the  four  treat- 
ments T4  to  T7  in  0.75ok2  as  it  was  for  any  two  of  the  four  treatments  Tt  to  T4. 
However,  when  we  wish  to  compare  any  of  the  three  treatments  Tx  to  T3  with 
any  of  the  three  T6  to  T7,  this  can  only  be  done  via  T4,  and  the  error  variance 
of  these  comparisons  is  2  X  0.75ak2  =  1.5ok2,  nearly  double  that  of  0.857ak2  for 
the  experiment  in  one  stage.  The  overall  average  variance  of  a  comparison  in 
the  two  stage  experiment  is  actually  1.071ck*. 

When  we  consider  that  the  experiment  in  two  stages  has  taken  24  runs  as 
compared  with  21  for  the  experiment  in  one  stage,  it  is  evident  that  the  latter 
has  been  much  more  satisfactory. 

In  parenthesis,  a  comparison  might  be  made  with  the  classical  type  of  experi- 
ment using  a  single  standard  throughout.  The  nearest  approach  to  a  balanced 
design  calls  for  6  blocks,  all  containing  the  standard  and  two  of  the  remaining  6 
treatments.  Each  of  the  latter  therefore  occurs  twice.  The  total  number  of  runs 
required  is  6  X  3  =  18,  which  is  slightly  less  than  the  21  for  the  balanced  incom- 
plete blocks.  The  accuracy  is  very  much  less.  There  are  three  separate  types 
of  comparison  in  this  classical  design  : — 

(a)  Between  any  treatment  and  the  standard.      These  have  variances  0.667ak2 

(b)  Between  any  two  treatments  which  occur  in  the  same  block.  These  have 
variances  1.333ok2. 

(c)  Between  any  two  treatments  not  in  the  same  block.  These  have  variances 
approximately  4.0ck2.  The  overall  average  is  approximately  2.285ak2. 
The  comparison  of  this  classical   design  with  a  "standard"  with  the 
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balanced  incomplete  block  is  thus  seen  to  be  very  unfavourable  to  the 

former,  when  it  is  remembered  that  in  the  latter  all  comparisons  have  the 

variance  0.857ak2. 

In  the  same  manner,  it  can  be  shown  that  one  large  factorial  experiment  is 

much  more  satisfactory  than  two  smaller  ones.     For  example,  with  6  factors  all 

at  2  levels,  it  is  much  better  to  have  then  in  one  full  experiment  requiring  26  =  64 

runs(2)  than  to  divide  the  experiment  into  two  parts,  each  with  3  factors,  requiring 

for  the  two  parts  a  total  of  2  X  2s  =  16  runs.     The  former  requires  appreciably 

more  work,  but  in  return  yields  the  following  advantages  : — 

(a)  The  main  effects  are  determined  with  8  times  the  accuracy. 

(b)  All  the  15  possible  first  order  interactions  are  evaluated  with  considerable 
accuracy  :  the  two  stage  experiment  only  estimates  6  of  these  15,  and 
then  only  with  one-eighth  the  accuracy. 

(c)  All  the  20  possible  second  order  interactions  are  estimated  :  in  the  two 
stage  experiment  only  2  are  estimated,  and  then  these  estimates  have  to 
be  used  as  error  so  are  of  no  value  as  regards  estimating  the  interactions 
themselves. 

(d)  All  the  15  possible  third  order  interactions  are  estimated.  There  are 
still  7  degrees  of  freedom  left  for  error :  if  we  use  these  plus  the  greater 
part  of  the  15  third  order  interactions  for  error  we  will  have  the  latter 
with  about  20  degrees  of  freedom.  This  is  much  more  satisfactory  than 
in  the  two-stage  experiment  where  we  start  off  with  a  second  order 
interaction  with  only  1  degree  of  freedom  as  error. 

(c)  Conclusions 

These  sections  on  the  design  of  experiments  have  been  written  rather  as 
though  it  were  a  fully-developed  subject  like  Euclid's  Geometry.  This  is  not  the 
case,  however :  these  statistical  designs  have  hitherto  only  been  employed  in  the 
chemical  industry  to  a  very  limited  extent,  their  development  taking  place  in  the 
science  of  agriculture,  where  there  is  now  a  body  of  experience  available  stretching 
over  twenty  years.  The  methods  are  clearly  capable  of  transplantation  on  the 
lines  indicated,  but  we  must  guard  against  an  undue  literalism  in  their  application. 
In  the  chemical  industry  certain  aspects  will  need  to  be  developed,  and  others 
perhaps  to  be  modified  as  not  suitable  in  the  changed  conditions.  Thus,  for 
example,  it  seems  that  for  the  presentation  of  final  results  on  which  action  may 
be  taken  the  5%  level  of  significance  is  inadequate,  and  we  really  require  the 
1%  or  better.  In  general,  therefore,  caution  should  be  used  in  their  interpretation 
till  there  is  a  greater  foundation  of  experience,  but  to  date  no  incident 
has  come  to  the  author's  notice  which  suggests  that  there  are  any  pitfalls. 

(")  Methods  have  been  developed  recently  for  factorial  experiments  with  large  numbers 
of  factors  which  allow  us  to  obtain  satisfactory  estimates  of  the  main  effects  and  lower 
order  (first  and  second)  interactions  with  only  a  fraction  of  the  full  number  of  runs  ;  c.f. 
D.  J.  Finney,  Annals  of  Eugenics,  Volume  12,  page  291,  1945. 
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APPENDIX 
TABLE  I 
Table  of  t 


Degrees 
of 

t 

Freedom 

0.10 

0.05 

0.02 

0.01 

0.001 

1 

6.31 

12.71 

31.82 

63.66 

636.62 

2 

2.92 

4.30 

6.97 

9.93 

31.60 

3 

2.35 

3.18 

4.54 

5.84 

12.94 

4 

2.13 

2.78 

3.75 

4.60 

8.61 

5 

2.02 

2.57 

3.37 

4.03 

6.86 

6 

194 

2.45 

3.14 

3.71 

5.96 

7 

1.90 

2.37 

3.00 

3.50 

5.41 

8 

1.86 

2.31 

2.90 

3.36 

5.04 

9 

1.83 

2.26 

2.82 

3.25 

4.78 

10 

1.81 

2.23 

2.76 

3.17 

4.59 

11 

1.80 

2.20 

2.72 

3.11 

4.44 

12 

1.78 

2.18 

2.68 

3.06 

4.32 

13 

1.77 

2.16 

2.65 

3.01 

4.22 

14 

1.76 

2.15 

2.62 

2.98 

4.14 

15 

1.75 

2.13 

2.60 

2.95 

4.07 

16 

1.75 

2.12 

2.58 

2.92 

4.02 

17 

1.74 

2.11 

2.57 

2.90 

3.97 

18 

1.73 

2.10 

2.55 

2.88 

3.92 

19 

1.73 

2.09 

2.54 

2.86 

3.88 

20 

1.73 

2.09 

2.53 

2.85 

3.85 

21 

1.72 

2.08 

2.52 

2.83 

3.82 

22 

1.72 

2.07 

2.51 

2.82 

3.79 

23 

1.71 

2.07 

2.50 

2.81 

3.77 

24 

1.71 

2.06 

2.49 

2.80 

3.75 

25 

1.71 

2.06 

2.48 

2.79 

3.73 

26 

1.71 

2.06 

2.48 

2.78 

3.71 

27 

1.70 

2.05 

2.47 

2.77 

3.69 

28              i 

1.70 

2.05 

2.47 

2.76 

3.67 

29 

1.70 

2.04 

2.46 

2.76 

3.66 

30 

1.70 

2.04 

2.46 

2.75 

3.65 

40 

1.68 

2.02 

2.42 

2.70 

3.55 

60 

1.67 

2.00 

2.39 

2.66 

3.46 

120 

1.66 

1.98 

2.36 

2.62 

3.37 

00 

1.65 

1.96 

2.33 

2.58 

3.29 

Abridged  from  Table  III  of  "Statistical  Tables  for  Biological,  Agricultural  and 
Medical  Research."     (R.  A.  Fisher  and  F.  Yates  :   Oliver  and  Boyd). 
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TABLE  II 
Table  of  X2 


Degrees 
of 

Freedom 

0.99 

0.98 

0.95 

0.90 

0.50 

0.10 

0.05 

0.02 

0.01 

0.001 

1 

0.000 

0.001 

0.004 

0.015 

0.455 

2.71 

3.84 

5.41 

6.64 

10.83 

2 

0.020 

0.040 

0.103 

0.211 

1.386 

4.61 

5.99 

7.82 

9.21 

13.82 

3 

0.115 

0.185 

0.352 

0.584 

2.366 

6.25 

7.82 

9.84 

11.34 

16.27 

4 

0.297 

0.429 

0.711 

1.064 

3.357 

7.78 

9.49 

11.67 

13.28 

18.47 

5 

0.554 

0.752 

1.145 

1.610 

4.351 

9.24 

11.07 

13.39 

15.09 

20.52 

6 

0.872 

1.134 

1.635 

2.204 

5.35 

10.65 

12.59 

15.03 

16.81 

22.46 

7 

1.239 

1.564 

2.167 

2.833 

6.35 

12.02 

14.07 

16.62 

18.48 

24.32 

8 

1.646 

2.032 

2.733 

3.490 

7.34 

13.36 

15.51 

18.17 

20.09 

26.13 

9 

2.088 

2.532 

3.325 

4.168 

8.34 

14.68 

16.92 

19.68 

21.67 

27.88 

10 

2.558 

3.059 

3.940 

4.865 

9.34 

15.99 

18.31 

21.16 

23.21 

29.59 

11 

3.05 

3.61 

4.57 

5.58 

10.34 

17.28 

19.68 

22.62 

24.73 

31.26 

12 

3.57 

4.18 

5.23 

6.30 

11.34 

18.55 

21.03 

24.05 

26.22 

32.91 

13 

4.11 

4.76 

5.89 

7.04 

12.34 

19.81 

22.36 

25.47 

27.69 

34.53 

14 

4.66 

5.37 

6.57 

7.79 

13.34 

21.06 

23.69 

26.87 

29.14 

36.12 

15 

5.23 

5.99 

7.26 

8.55 

14.34 

22.31 

25.00 

28.26 

30.58 

37.70 

16 

5.81 

6.61 

7.96 

9.31 

15.34 

23.54 

26.30 

39.63 

32.00 

39.25 

17 

6.41 

7.26 

8.67 

10.09 

16.34 

24.77 

27.59 

31.00 

33.41 

40.79 

18 

7.02 

7.91 

9.39 

10.87 

17.34 

25.99 

28.87 

32.35 

34.81 

42.31 

19 

7.63 

8.57 

10.12 

11.65 

18.34 

27.20 

30.14 

33.69 

36.19 

43.82 

20 

8.26 

9.24 

10.85 

12.44 

19.34 

28.41 

31.41 

35.02 

37.57 

45.32 

21 

8.90 

9.91 

11.59 

13.34 

20.34 

29.61 

32.67 

36.34 

38.93 

46.80 

22 

9.54 

10.60 

12.34 

14.04 

21.34 

30.81 

33.92 

37.66 

40.29 

48.27 

23 

10.20 

11.29 

13.09 

14.85 

22.34 

32.01 

35.17 

38.97 

41.64 

49.73 

24 

10.86 

11.99 

13.85 

15.66 

23.34 

33.20 

36.42 

40.27 

42.98 

51.18 

25 

11.52 

12.70 

14.61 

16.47 

24.34 

34.38 

37.65 

41.57 

44.31 

52.62 

26 

12.20 

13.41 

15.38 

17.29 

25.34 

35.56 

38.89 

42.86 

45.64 

54.05 

27 

12.88 

14.12 

16.15 

18.11 

26.34 

36.74 

40.11 

44.14 

46.96 

55.48 

28 

13.56 

14.85 

16.93 

18.94 

27.34 

37.92 

41.34 

45.42 

48.28 

56.89 

29 

14.26 

15.57 

17.71 

19.77 

28.34 

39.09 

42.56 

46.69 

49.59 

58.30 

30 

14.95 

16.31 

18.49 

20.60 

29.34 

40.26 

43.77 

47.96 

50.89 

59.70 

Abridged  from  Table  IV  of  "Statistical  Tables  for  Biological,  Agricultural  and 
Medical  Research."     (R.  A.  Fisher  and  F.  Yates  :    Oliver  and  Boyd). 
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TABLE  III 
Tables  of  Variance  Ratio  (i) 


N2    \ 

0.20  Significance  Level 

1 

2 

3 

4 

5 

6 

12 

24 

oo 

1 

9.5 

12.0 

13.1 

13.7 

14.0 

14.3 

14.9 

15.2 

15.6 

2 

3.6 

4.0 

4.2 

4.2 

4.3 

4.3 

4.4 

4.4 

4.5 

3 

2.7 

2.9 

2.9 

3.0 

3.0 

3.0 

3.0 

3.0 

3.0 

4 

2.4 

2.5 

2.5 

2.5 

2.5 

2.5 

2.5 

2.4 

2.4 

5 

2.2 

2.3 

2.3 

2.2 

2.2 

2.2 

2.2 

2.2 

2.1 

6 

2.1 

2.1 

2.1 

2.1 

2.1 

2.1 

2.0 

2.0 

2.0 

7 

2.0 

2.0 

2.0 

2.0 

2.0 

2.0 

1.9 

1.9 

1.8 

8 

2.0 

2.0 

2.0 

1.9 

1.9 

1.9 

1.8 

1.8 

1.7 

9 

1.9 

1.9 

1.9 

1.9 

1.9 

1.8 

1.8 

1.7 

1.7 

10 

1.9 

1.9 

1.9 

1.8 

1.8 

1.8 

1.7 

1.7 

1.6 

11 

1.9 

1.9 

1.8 

1.8 

1.8 

1.8 

1.7 

1.6 

1.6 

12 

1.8 

1.8 

1.8 

1.8 

1.7 

1.7 

1.7 

1.6 

1.5 

13 

1.8 

1.8 

1.8 

1.8 

1.7 

1.7 

1.6 

1.6 

1.5 

14 

1.8 

1.8 

1.8 

1.7 

1.7 

1.7 

1.6 

1.6 

1.5 

15 

1.8 

1.8 

1.8 

1.7 

1.7 

1.7 

1.6 

1.5 

1.5 

16 

1.8 

1.8 

1.7 

1.7 

1.7 

1.6 

1.6 

1.5 

1.4 

17 

1.8 

1.8 

1.7 

1.7 

1.7 

1.6 

1.6 

1.5 

1.4 

18 

1.8 

1.8 

1.7 

1.7 

1.6 

1.6 

1.5 

1.5 

1.4 

19 

1.8 

1.8 

1.7 

1.7 

1.6 

1.6 

1.5 

1.5 

1.4 

20 

1.8 

1.8 

1.7 

1.7 

1.6 

1.6 

1.5 

1.5 

1.4 

22 

1.8 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.4 

24 

1.7 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.3 

26 

1.7 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.3 

28 

1.7 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.3 

30 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.5 

1.4 

1.3 

40 

1.7 

1.7 

1.6 

1.6 

1.5 

1.5 

1.4 

1.4 

1.2 

60 

1.7 

1.7 

1.6 

1.6 

1.5 

1.5 

1.4 

1.3 

1.2 

120 

1.7 

1.6 

1.6 

1.5 

1.5 

1.5 

1.4 

1.3 

1.1 

oo 

1.6 

1.6 

1.6 

1.5 

1.5 

1.4 

1.3 

1.2 

1.0 

Abridged  from  Table  V  of  "Statistical  Tables  for  Biological,  Agricultural  and 
Medical  Research."     (R.  A.  Fisher  and  F.  Yates  :    Oliver  and  Boyd). 
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Table  of  Variance  Ratio  (11) 


XV      N, 

N2      X^ 

0.05  Significance  Level 

1 

2 

3 

4 

5 

6 

12 

24 

oo 

1 

164.4 

199.5 

215.7 

224.6 

230.2 

234.0 

234.9 

249.0 

254.3 

2 

18.5 

19.2 

19.2 

19.3 

19.3 

19.3 

19.4 

19.5 

19.5 

3 

10.1 

9.6 

9.3 

9.1 

9.0 

8.9 

8.7 

8.6 

8.5 

4 

7.7 

6.9 

6.6 

6.4 

6.3 

6.2 

5.9 

5.8 

5.6 

5 

6.6 

5.8 

5.4 

5.2 

5.1 

5.0 

4.7 

4.5 

4.4 

6 

6.0 

5.1 

4.8 

4.5 

4.4 

4.3 

4.0 

3.8 

3.7 

7 

5.6 

4.7 

4.4 

4.1 

4.0 

3.9 

3.6 

3.4 

3.2 

8 

5.3 

4.5 

4.1 

3.8 

3.7 

3.6 

3.3 

3.1 

2.9 

9 

5.1 

4.3 

3.9 

3.6 

3.5 

3.4 

3.1 

2.9 

2.7 

10 

5.0 

4.1 

3.7 

3.5 

3.3 

3.2 

2.9 

2.7 

2.5 

11 

4.8 

4.0 

3.6 

3.4 

3.2 

3.1 

2.8 

2.6 

2.4 

12 

4.8 

3.9 

3.5 

3.3 

3.1 

3.0 

2.7 

2.5 

2.3 

13 

4.7 

3.8 

3.4 

3.2 

3.0 

2.9 

2.6 

2.4 

2.2 

14 

4.6 

3.7 

3.3 

3.1 

3.0 

2.9 

2.5 

2.3 

2.1 

15 

4.5 

3.7 

3.3 

3.1 

2.9 

2.8 

2.5 

2.3 

2.1 

16 

4.5 

3.6 

3.2 

3.0 

2.9 

2.7 

2.4 

2.2 

2.0 

17 

4.5 

3.6 

3.2 

3.0 

2.8 

2.7 

2.4 

2.2 

2.0 

18 

4.4 

3.6 

3.2 

2.9 

2.8 

2.7 

2.3 

2.1 

1.9 

19 

4.4 

3.5 

3.1 

2.9 

2.7 

2.6 

2.3 

2.1 

1.9 

20 

4.4 

3.5 

3.1 

2.9 

2.7 

2.6 

2.3 

2.1 

1.8 

22 

4.3 

3.4 

3.1 

2.8 

2.7 

2.6 

2.2 

2.0 

1.8 

24 

4J 

3.4 

3.0 

2.8 

2.6 

2.5 

2.2 

2.0 

1.7 

26 

4.2 

3.4 

3.0 

2.7 

2.6 

2.5 

2.2 

2.0 

1.7 

28 

4.2 

3.3 

3.0 

2.7 

2.6 

2.4 

2.1 

1.9 

1.7 

30 

4.2 

3.3 

2.9 

2.7 

2.5 

2.4 

2.1 

1.9 

1.6 

40 

4.1 

3.2 

2.9 

2.6 

2.5 

2.3 

2.0 

1.8 

1.5 

60 

4.0 

3.2 

2.8 

2.5 

2.4 

2.3 

1.9 

1.7 

1.4 

120 

3.9 

3.1 

2.7 

2.5 

2.3 

2.2 

1.8 

1.6 

1.3 

00 

3.8 

3.0 

2.6 

2.4 

2.2 

2.1 

1.8 

1.5 

1.0 

Abridged  from  Table  V  of  "Statistical  Tables  for  Biological,  Agricultural,  and 
Medical  Research."     (R.  A.  Fisher  and  F.  Yatea  :   Oliver  and  Boyd). 
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Tables  of  Variance  Ratio  (iii) 


0.01 

Significance  Level 

1 

2 

3 

4 

5 

6 

8 

12 

24 

00 

1 

4052 

4999 

5403 

5625 

5764 

5859 

5981 

6106 

6234 

6366 

2 

98.5 

99.0 

99.2 

99.3 

99.3 

99.4 

99.3 

99.4 

99.5 

99.5 

3 

34.1 

30.8 

29.5 

28.7 

28.2 

27.9 

27.5 

27.1 

26.6 

26.1 

4 

21.2 

18.0 

16.7 

16.0 

15.5 

15.2 

14.8 

14.4 

13.9 

13.5 

5 

16.3 

13.3 

12.1 

11.4 

11.0 

10.7 

10.3 

9.9 

9.5 

9.0 

6 

13.7 

10.9 

9.8 

9.2 

8.8 

8.5 

8.1 

7.7 

7.3 

6.9 

7 

12.3 

9.6 

8.5 

7.9 

7.5 

7.2 

6.8 

6.5 

6.1 

5.7 

8 

11.3 

8.7 

7.6 

7.0 

6.6 

6.4 

6.0 

5.7 

5.3 

4.9 

9 

10.6 

8.0 

7.0 

6.4 

6.1 

5.8 

5.5 

5.1 

4.7 

4.3 

10 

10.0 

7.6 

6.6 

6.0 

5.6 

5.4 

5.1 

4.7 

4.3 

3.9 

11 

9.7 

7.2 

6.2 

5.7 

5.3 

5.1 

4.7 

4.4 

4.0 

3.6 

12 

9.3 

6.9 

6.0 

5.4 

5.1 

4.8 

4.5 

4.2 

3.8 

3.4 

13 

9.1 

6.7 

5.7 

5.2 

4.9 

4.6 

4.3 

4.0 

3.6 

3.2 

14 

8.9 

6.5 

5.6 

5.0 

4.7 

4.5 

4.1 

3.8 

3.4 

3.0 

15 

8.7 

6.4 

5.4 

4.9 

4.6 

4.3 

4.0 

3.7 

3.3 

2.9 

16 

8.5 

6.2 

5.3 

4.8 

4.4 

4.2 

3.9 

3.6 

3.2 

2.8 

17 

8.4 

6.1 

5.2 

4.7 

4.3 

4.1 

3.8 

3.5 

3.1 

2.7 

18 

8.3 

6.0 

5.1 

4.6 

4.3 

4.0 

3.7 

3.4 

3.0 

2.6 

19 

8.2 

5.9 

5.0 

4.5 

4.2 

3.9 

3.6 

3.3 

2.9 

2.5 

20 

8.1 

5.9 

4.9 

4.4 

4.1 

3.9 

3.6 

3.2 

2.9 

2.4 

22 

7.9 

5.7 

4.8 

4.3 

4.0 

3.8 

3.5 

3.1 

2.8 

2.3 

24 

7.8 

5.6 

4.7 

4.2 

3.9 

3.7 

3.3 

3.0 

2.7 

2.2 

26 

7.7 

5.5 

4.6 

4.1 

3.8 

3.6 

3.3 

3.0 

2.6 

2.1 

28 

7.6 

5.5 

4.6 

4.1 

3.8 

3.5 

3.2 

2.9 

2.5 

2.1 

30 

7.6 

5.4 

4.5 

4.0 

3.7 

3.5 

3.2 

2.8 

2.5 

2.0 

40 

7.3 

5,2 

4.3 

3.8 

3.5 

3.3 

3.0 

2.7 

2.3 

1.8 

60 

7.1 

5.0 

4.1 

3.7 

3.3 

3.1 

2.8 

2.5 

2.1 

1-6 

120 

6.9 

4.8 

4.0 

3.5 

3.2 

3.0 

2.7 

2.3 

2.0 

1.4 

00 

6.6 

4.6 

3.8 

3.3 

3.0 

2.8 

2.5 

2.2 

1.8 

1.0 

Abridged  from  Table  V  of  "Statistical  Tables  for  Biological,  Agricultural  and 
Medical  Research.*'     (R.  A.  Fisher  and  F.  Yates  :   Oliver  and  Boyd). 
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Table  of  Variance  Ratio  (iv) 


\v        N, 
Na     ^\ 

0.001 

Signif 

cance  Level 

1 

2 

3 

4 

5 

6 

8 

12 

24 

oo 

1 

varying  from  400,000  to 

500,000 

2 

998 

999 

999 

999 

999 

999 

999 

999 

999 

999 

3 

167 

148 

141 

137 

135 

133 

131 

128 

126 

123 

4 

74.1 

61.3 

56.2 

53.4 

51.7 

50.5 

49.0 

47.4 

45.8 

44.1 

5 

47.0 

36.6 

33.2 

31.1 

29.8 

28.8 

27.6 

26.4 

25.1 

23.8 

6 

35.5 

27.0 

23.7 

21.9 

20.8 

20.0 

19.0 

18.0 

16.9 

15.8 

7 

29.2 

21.7 

18.8 

17.2 

16.2 

15.5 

14.6 

13.7 

12.7 

11.7 

8 

25.4 

18.5 

15.8 

14.4 

13.5 

12.9 

12.0 

11.2 

10.3 

9.3 

9 

22.9 

16.4 

13.9 

12.6 

11.7 

11.1 

10.4 

9.6 

8.7 

7.8 

10 

21.0 

14.9 

12.6 

11.3 

10.5 

9.9 

9.2 

8.5 

7.6 

6.8 

11 

19.7 

13.8 

11.6 

10.4 

9.6 

9.1 

8.3 

7.6 

6.9 

6.0 

12 

18.6 

13.0 

10.8 

9.6 

8.9 

8.4 

7.7 

7.0 

6.3 

5.4 

13 

17.8 

12.3 

10.2 

9.1 

8.4 

7.9 

7.2 

6.5 

5.8 

5.0 

14 

17.1 

11.8 

9.7 

8.6 

7.9 

7.4 

6.8 

6.1 

5.4 

4.6 

15 

16.6 

11.3 

9.3 

8.3 

7.6 

7.1 

6.5 

5.8 

5.1 

4.3 

16 

16.1 

11.0 

9.0 

7.9 

7.3 

6.8 

6.2 

5.6 

4.9 

4.1 

17 

15.7 

10.7 

8.7 

7.7 

7.0 

6.6 

6.0 

5.3 

4.6 

3.9 

18 

15.4 

10.4 

8.5 

7.5 

6.8 

6.4 

5.8 

5.1 

4.5 

3.7 

19 

15.1 

10.2 

8.3 

7.3 

6.6 

6.2 

5.6 

5.0 

4.3 

3.5 

20 

14.8 

10.0 

8.1 

7.1 

6.5 

6.0 

5.4 

4.8 

4.2 

3.4 

22 

14.4 

9.6 

7.8 

6.8 

6.2 

5.8 

5.2 

4.6 

3.9 

3.2 

24 

14.0 

9.3 

7.6 

6.6 

6.0 

5.6 

5.0 

4.4 

3.7 

3.0 

26 

13.7 

9.1 

7.4 

6.4 

5.8 

5.4 

4.8 

4.2 

3.6 

2.8 

28 

13.5 

8.9 

7.2 

6.3 

5.7 

5.2 

4.7 

4.1 

3.5 

2.7 

30 

13.3 

8.8 

7.1 

6.1 

5.5 

5.1 

4.6 

4.0 

3.4 

2.6 

40 

12.6 

8.2 

6.6 

5.7 

5.1 

4.7 

4.2 

3.6 

3.0 

2.2 

60 

12.0 

7.8 

6.2 

5.3 

4.8 

4.4 

3.9 

3.3 

2.7 

1.9 

120 

11.4 

7.3 

5.8 

5.0 

4.4 

4.0 

3.5 

3.0 

2.4 

1.6 

oo 

10.8 

6.9 

5.4 

5.6 

4.1 

3.7 

3.3 

2.7 

2.1 

1.0 

Abridged  from  Table  V  of  "Statistical  Tables  for  Biological,  Agricultural  and 
Medical  Research."     (R.  A.  Fisher  and  F.  Yates  :    Oliver  and  Boyd). 
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TABLE  IV 
Table  of  the  Correlation  Coefficient 


Degrees 

r 

of 

Freedom 

0.10 

0.05 

0.02 

0.01 

0.001 

1 

.988 

.997 

.999 

1.000 

1.000 

2 

.900 

.950 

.980 

.990 

.999 

3 

.805 

.878 

.934 

.959 

.992 

4 

.729 

.811 

.882 

.917 

.974 

5 

.669 

.754 

.833 

.874 

.951 

6 

.621 

.707 

.789 

.834 

.925 

.  7 

.582 

.666 

.750 

.798 

.898 

8 

.549 

.632 

.716 

.765 

.872 

9 

.521 

.602 

.685 

.735 

.847 

10 

.497 

.576 

.658 

.708 

.823 

11 

.476 

.553 

.634 

.684 

.801 

12 

.457 

.532 

.612 

.661 

.780 

13 

.441 

.514 

.592 

.641 

.760 

14 

.426 

.497 

.574 

.623 

.742 

15 

.412 

.482 

.558 

.606 

.725 

16 

.400 

.468 

.543 

.590 

.708 

17 

.389 

.456 

.528 

.575 

.693 

18 

.378 

.444 

.516 

.561 

.679 

19 

.369 

.433 

.503 

.549 

.665 

20 

.360 

.423 

.492 

.537 

.652 

25 

.323 

.381 

.445 

.487 

.597 

30 

.296 

.349 

.409 

.449 

.554 

35 

.275 

.325 

.381 

.418 

.519 

40 

.257 

.304 

.358 

.393 

.490 

45 

.243 

.287 

.338 

.372 

.465 

50 

.231 

.273 

.322 

.354 

.443 

60 

.211 

.250 

.295 

.325 

.408 

70 

.195 

.232 

.274 

.302 

.380 

80 

.183 

.217 

.256 

.283 

.357 

90 

.173 

.205 

.242 

.267 

.337 

100 

.164 

.195 

.230 

.254 

.321 

Abridged  from  Table  VI  of  "Statistical  Tables  for  Biological,  Agricultural  and 
Medical  Research."     (R.  A.  Fisher  and  F.  Yates  :   Oliver  and  Boyd). 
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TABLE  V 
Factors  for  Control  Charts 


Number  in 

A' 

A' 

D' 

D' 

d 

Sample 

0.025 

0.001 

0.975 

0.999 

n 

2 

1.229 

1.937 

2.81 

4.12 

1.13 

3 

0.668 

1.054 

2.17 

2.98 

1.69 

4 

0.476 

0.750 

1.93 

2.57 

2.06 

5 

0.377 

0.594 

1.81 

2.34 

2.33 

6 

0.316 

0.498 

1.72 

2.21 

2.53 

7 

0.274 

0.432 

1.66 

2.11 

2.70 

8 

0.244 

0.384 

1.62 

2.04 

2.85 

9 

0.220 

0.347 

1.58 

1.99 

2.97 

10 

0.202 

0.317 

1.56 

1.93 

3.08 

Abridged  from  B.  S.  600R  "Quality  Control  Charts".     (B.  P.  Dudding  and 
W.  J.  Jennett.     British  Standards  Institution.) 


TABLE  VI 
The  Angular  Transformation  of  Percentages  to  Degrees 


p% 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

0 

5.7 

8.1 

10.0 

11.5 

12.9 

14.2 

15.3 

16.4 

17.5 

10 

18.4 

19.4 

20.3 

21.1 

22.0 

22.8 

23.6 

24.4 

25.1 

25.8 

20 

26.6 

27.3 

28.0 

28.7 

29.3 

30.0 

30.7 

31.3 

31.9 

32.6 

30 

33.2 

33.8 

34.4 

35.1 

35.7 

36.3 

36.9 

37.5 

38.1 

38.6 

40 

39.2 

39.8 

40.6 

41.0 

41.6 

42.1 

42.7 

43.3 

43.9 

44.4 

50 

45.0 

45.6 

46.1 

46.7 

47.3 

47.9 

48.4 

49.0 

49.6 

50.2 

60 

50.8 

51.4 

51.9 

52.5 

53.1 

53.7 

54.3 

54.9 

55.6 

56.2 

70 

56.8 

57.4 

58.1 

58.7 

59.3 

60.0 

60.7 

61.3 

62.0 

62.7 

80 

63.4 

64.2 

64.9 

65.6 

66.4 

67.2 

68.0 

68.9 

69.7 

70.6 

90 

71.6 

72.5 

73.6 

74.7 

75.8 

77.1 

78.5 

80.0 

81.0 

84.3 

From  Table  XII  of  "Statistical  Tables  for  Biological,  Agricultural  and  Medical  Research' 
(R.  A.  Fisher  and  F.  Yates  ;   Oliver  and  Boyd). 
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TABLE  VII 
Random  Numbers  (I) 


03  47  43  73  86 
97  74  24  67  62 
16  76  62  27  66 
12  56  85  99  26 
55  59  56  35  64 

36  96  47  36  61 
42  81  14  57  20 
56  50  26  71  07 
96  96  68  27  31 
38  54  82  46  22 

46  98  63  71  62 
42  53  32  37  32 
32  90  79  78  53 
05  03  72  93  15 
31  62  43  09  90 

33  26  16  80  45 
27  07  36  07  51 
13  55  38  58  59 
57  12  10  14  21 
06  18  44  32  53 

60  11  14  10  95 
24  51  79  89  73 
88  97  54  14  10 
88  26  49  81  76 
23  83  01  30  30 

16  22  77  94  39 
84  42  17  53  31 
63  01  63  78  59 
33  21  12  34  29 
57  60  86  32  44 

49  54  43  54  82 
57  24  55  06  88 
16  95  55  67  19 
78  64  56  07  82 
09  47  27  96  54 

17  37  93  23  78 
77  04  74  47  67 
98  10  50  71  75 
52  42  07  44  38 
49  17  46  09  62 

87  35  20  96  43 
21  76  33  50  25 
12  86  73  58  07 
15  51  00  13  42 
90  52  84  77  27 

84  26  34  91  64 
83  92  12  06  76 
44  39  52  38  79 
99  66  02  79  54 
08  02  73  43  28 

18  18  07  92  46 
26  62  38  97  75 
23  42  40  64  74 
52  36  28  19  95 
37  85  94  35  12 

44  17  16  58  09 
84  16  07  44  99 

82  97  77  77  81 
50  92  26  11  97 

83  39  50  08  30 

79  83  86  19  62 
83  11  46  32  24 
07  45  32  14  08 
00  56  76  31  38 
42  34  07  96  88 

06  76  50  03  10 
20  14  85  88  45 
32  98  94  07  72 
80  22  02  53  53 
54  42  06  87  98 

55  23  64  05  05 
10  93  72  88  71 
93  85  79  10  75 
86  60  42  04  53 
35  85  29  48  39 

70  29  17  12  13 
56  62  18  37  35 
99  49  57  22  77 
16  08  15  04  72 
31  16  93  32  43 

40  33  20  38  26 
96  83  50  87  75 
88  42  95  45  72 
33  27  14  34  90 
50  27  89  87  19 

13  89  51  03  74 
97  12  25  93  47 
16  64  36  16  00 
45  59  34  68  49 
20  15  37  00  49 

17  76  37  13  04 
70  33  24  03  54 
04  43  18  66  79 
12  72  07  34  45 
52  85  66  60  44 

07  74  21  19  30 
97  77  46  44  80 
94  77  24  21  90 
99  27  72  95  14 
38  68  88  11  80 

68  34  30  13  70 
74  57  25  65  76 
27  42  37  86  53 
00  39  68  29  61 
29  94  98  94  24 

55  74  30  77  40 
59  29  97  68  60 
48  55  90  65  72 
66  37  32  20  30 
68  49  69  10  82 

44  22  78  84  26 
71  91  38  67  54 
96  57  69  36  10 
77  84  57  03  29 
53  75  91  93  30 

04  33  46  09  52 
13  58  18  24  76 
96  46  92  42  45 
10  45  65  04  26 
34  25  20  57  27 

68  07  97  06  57 
15  54  55  95  52 
97  60  49  04  91 
11  04  96  67  24 
40  4S  73  51  92 

16  90  82  66  59 
11  27  94  75  06 
35  24  10  16  20 
38  23  16  86,38 
31  96  25  91  47 

83  62  64  11  12 
06  09  19  74  66 
33  32  51  26  38 
42  38  97  01  50 
96  44  33  49  13 

67  19  00  71  74 
02  94  37  34  02 
79  78  45  04  91 
87  75  66  81  41 
34  86  82  53  91 

60  47  21  29  68 
76  70  90  30  86 
16  92  53  56  16 
40  01  74  91  62 
00  52  43  48  85 

02  02  37  03  31 
38  45  94  30  38 
02  75  50  95  98 
48  51  84  08  32 
27  55  26  89  62 

66  67  40  67  14 
14  90  84  45  11 
68  05  51  18  00 
20  46  78  73  90 
64  19  58  97  79 

64  05  71  95  86 
75  73  88  05  90 
33  96  02  75  19 
97  51  40  14  02 
15  06  15  93  20 

11  05  65  09  68 
52  27  41  14  86 
07  60  62  93  55 
04  02  33  31  08 
01  90  10  75  06 

76  83  20  37  90 
22  98  12  22  08 
59  33  82  43  90 

39  54  16  49  36 

40  78  78  89  62 

57  16  00  11  66 
07  52  74  95  80 
49  37  38  44  59 
47  95  93  13  30 
02  67  74  17  33 

05  26  93  70  60 
07  97  10  88  23 
68  71  86  85  85 
26  99  61  65  53 
14  65  52  68  75 

22  35  85  15  13 
09  98  42  99  64 
54  87  66  47  54 
58  37  78  80  70 
87  59  36  22  41 

92  03  51  59  77 
61  71  62  99  15 
73  32  08  11  12 
42  10  50  67  42 
26  78  63  06  55 

59  56  78  06  83 
06  51  29  16  93 
44  95  92  63  16 
32  17  55  85  74 
13  08  27  01  50 

52  91  05  70  74 
58  05  77  09  51 
29  56  24  29  48 
94  44  67  16  94 
15  29  39  39  43 

17  53  77  58  71 

90  26  59  21  19 
41  23  52  55  99 
60  20  50  81  69 

91  25  38  05  90 

71  41  61  50  72 
23  52  23  33  12 
31  04  49  69  96 
31  99  73  68  68 
94  58  28  41  36 

12  41  94  96  26 
96  93  02  18  39 
10  47  48  45  88 
35  81  33  03  76 
45  37  59  03  09 

44  95  27  36  99 
07  02  18  36  07 
13  41  43  89  20 
24  30  12  48  60 
90  35  57  29  12 

02  96  74  30  83 
25  99  32  70  23 
97  17  14  49  17 
18  99  10  72  34 
82  62  54  65  60 

34  50  57  74  37 
85  22  04  39  43 
09  79  13  77  48 
88  75  80  18  14 
90  96  23  70  00 

98  80  33  00  91 
73  81  53  94  79 
73  82  97  22  21 
22  95  75  42  49 
39  00  03  06  90 

09  77  93  19  82 
33  62  46  86  28 
05  03  27  24  83 
39  32  82  22  49 
55  85  78  38  36 

74  94  80  04  04 
08  31  54  46  31 
72  89  44  05  60 
02  48  07  70  37 
94  37  30  69  32 

45  07  31  66  49 
53  94  13  38  47 
35  80  39  94  88 
16  04  61  67  87 
90  89  00  76  33 

Abridged  from  Table  XXXIII  of  "Statistical  Tables  for  Biological,  Agricultural,  and  Medical 
Research"  (R.  A.  Fisher  and  F.  Yates  :  Oliver  and  Boyd). 
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TABLE  VII 
Random  Numbers  (II) 


53  74  23  99  67 

61  32  28  69  84 

94  62  67  86  24 

98  33  41  19  95 

47  53  53  38  09 

63  38  06  86  54 

99  00  65  26  94 

02  82  90  23  07 

79  62  67  80  60 

75  91  12  81  19 

35  30  58  21  46 

06  72  17  10  94 

25  21  31  75  96 

49  28  24  00  49 

55  65  79  78  07 

63  43  36  82  69 

65  51  18  37  88 

61  38  44  12  45 

32  92  85  88  65 

54  34  81  85  35 

98  25  37  55  26 

01  91  82  81  46 

74  71  12  94  97 

24  02  71  37  07 

03  92  18  66  75 

02  63  21  17  69 

71  50  80  89  56 

38  15  70  11  48 

43  40  45  86  98 

00  83  26  91  03 

64  55  22  21  82 

48  22  28  06  00 

61  54  13  43  91 

82  78  12  23  29 

06  66  24  12  27 

85  07  26  13  89 

01  10  07  82  04 

59  63  69  36  03 

69  11  15  83  80 

13  29  54  19  28 

58  54  16  24  15 

51  54  44  82  00 

62  61  65  04  69 

38  18  65  18  97 

85  72  13  49  21 

34  85  27  84  87 

61  48  64  56  26 

90  18  48  13  26 

37  70  15  42  57 

65  65  80  39  07 

03  92  18  27  46 

57  99  16  96  56 

30  33  72  85  22 

84  64  38  56  98 

99  01  30  98  64 

62  95  30  27  59 

37  75  41  66  48 

86  97  80  61  45 

23  53  04  01  63 

45  76  08  64  27 

08  45  93  15  22 

60  21  75  46  91 

98  77  27  85  42 

28  88  61  08  84 

69  62  03  42  73 

07  08  55  18  40 

45  44  75  13  90 

24  94  96  61  02 

57  55  66  83  15 

73  42  37  11  61 

01  85  89  95  66 

51  10  19  34  88 

15  84  97  19  75 

12  76  39  43  78 

64  63  91  08  25 

72  84  71  14  35 

19  11  58  49  26 

50  11  17  17  76 

86  31  57  20  18 

95  60  78  46  75 

88  78  28  16  84 

13  52  53  94  53 

75  45  69  30  96 

73  89  65  70  31 

99  17  43  48  76 

45  17  75  65  57 

28  40  19  72  12 

25  12  74  75  67 

60  40  60  81  19 

24  62  01  61  16 

96  76  28  12  54 

22  01  11  94  25 

71  96  16  16  88 

68  64  36  74  45 

19  59  50  88  92 

43  31  67  72  30 

24  02  94  08  63 

38  32  36  66  02 

69  36  38  25  39 

48  03  45  15  22 

50  44  66  44  21 

66  06  58  05  62 

68  15  54  35  02 

42  35  48  96  32 

14  52  41  52  48 

22  66  22  15  86 

26  63  75  41  99 

58  42  36  72  24 

58  37  52  18  51 

03  37  18  39  11 

96  24  40  14  51 

23  22  30  88  57 

95  67  47  29  83 

94  69  40  06  07 

18  16  36  78  86 

31  73  91  61  19 

60  20  72  93  48 

98  57  07  23  69 

65  95  39  69  58 

56  80  30  19  44 

78  60  73  99  84 

43  89  94  36  45 

56  69  47  07  41 

90  22  91  07  12 

78  35  34  08  72 

84  37  90  61  56 

70  10  23  98  05 

85  11  34  76  60 

76  48  45  34  60 

01  64  18  3-9  96 

36  67  10  08  23 

98  93  35  08  86 

99  29  76  29  81 

33  34  91  58  93 

63  14  52  32  52 

07  28  59  07  48 

89  64  58  89  75 

83  85  62  27  89 

30  14  78  56  27 

86  63  59  80  02 

10  15  83  87  60 

79  24  31  66  56 

21  48  24  06  93 

91  98  94  05  49 

01  47  59  38  00 

55  19  68  97  65 

03  73  52  16  56 

00  53  55  90  27 

33  42  29  38  87 

22  13  88  83  34 

53  81  29  13  39 

35  01  20  71  34 

62  33  74  82  14 

53  73  19  09  03 

56  54  29  56  93 

51  86  32  68  92 

33  98  74  66  99 

40  14  71  94  58 

45  94  19  38  81 

14  44  99  81  07 

35  91  70  29  13 

80  03  54  07  27 

96  94  78  32  66 

50  95  52  74  33 

13  80  55  62  54 

37  71  67  95  13 

20  02  44  95  94 

64  85  04  05  72 

01  32  90  76  14 

53  89  74  60  41 

93  66  13  83  27 

92  79  64  64  72 

28  54  96  53  84 

48  14  52  98  94 

56  07  93  89  30 

02  96  08  45  65 

13  05  00  41  84 

93  07  54  72  59 

21  45  57  09  77 

19  48  56  27  44 

49  83  43  48  35 

82  88  33  69  96 

72  36  04  19  76 

47  45  15  18  60 

82  11  08  95  97 

84  60  71  62  46 

40  80  81  30  37 

34  39  23  05  38 

25  15  35  71  30 

88  12  57  21  77 

18  17  30  88  71 

44  91  14  88  47 

89  23  30  63  15 

56  34  20  47  89 

99  82  93  24  98 

79  69  10  61  78 

71  32  76  95  62 

87  00  22  58  40 

92  54  01  75  25 

43  11  71  99  31 

75  93  36  57  83 

56  20  14  82  11 

74  21  97  90  65 

96  42  68  63  86 

74  54  13  26  94 

38  30  92  29  03 

06  28  81  39  38 

62  25  06  84  63 

61  29  08  93  67 

04  -32  92  08  09 

51  29  50  10  34 

31  57  75  95  80 

51  97  02  74  77 

76  15  48  49  44 

18  55  63  77  09 

21  31  38  86  24 

37  79  81  53  74 

73  24  16  10  33 

52  83  90  94  76 

70  47  14  54  36 

29  01  23  87  88 

58  02  39  37  67 

42  10  14  20  92 

16  55  23  42  45 

54  96  09  11  06 

95  33  95  22  00 

18  74  72  00  18 

38  79  58  69  32 

81  76  80  26  92 

82  80  84  25  39 

90  84  60  79  80 

24  36  59  87  38 

82  07  53  89  35 

96  35  23  79  18 

05  98  90  07  35 

46  40  62  98  82 

54  97  20  56  95 

15  74  80  08  32 

16  46  70  50  80 

67  72  16  42  79 

20  31  89  03  43 

38  46  82  68  72 

32  14  82  99  70 

80  60  47  18  97 

63  49  30  21  30 

71  59  73  05  50 

08  22  23  71  77 

91  01  93  20  49 

82  96  59  26  94 

66  39  67  98  60 

Abridged  from  TableXXXIII  of  "Statistical Tables  for  Biological,  Agricultural  and  Medical 
Research"  (R.  A.  Fisher  and  F.  Yates  :    Oliver  and  Boyd). 
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